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Trading places 


Scientists have a valuable part to play in clarifying the impacts of a proposed trade treaty between 


the United States and Europe. 


European Union (EU), are negotiating an accord — the Trans- 

atlantic Trade and Investment Partnership (TTIP). They argue 
that it could boost the world economy by more than US$300 billion, 
and create millions of jobs. But at what price? 

Free-trade agreements have historically boosted economic growth 
by eliminating border tariffs and opening up markets. But TTIP is 
different. Few significant tariff barriers exist between the United States 
and the EU, so the proposed accord focuses on reducing the economic 
impact of ‘non-tariff barriers’ — or, in plain language, reforms of 
standards and regulations on everything from the environment and 
public health to agriculture and pharmaceuticals. These could have 
wide-ranging and profound impacts, for good and bad. 

There are clear benefits to be had from greater harmonization 
between the United States and the EU in some regulatory matters. 
Streamlining pharmaceutical regulation, for instance — so that an 
EU-approved product could be sold in the United States and vice versa 
— would reduce red tape and duplication. It would also boost research 
cooperation, cutting the costs of developing drugs and, ultimately, 
lowering prices. But other provisions of the accord could give drug 
companies more say in pricing and reimbursement policies — and so 
might result in higher medical costs and reduced access to health care. 
The humanitarian organization Médecins Sans Frontiéres (Doctors 
Without Borders) has also expressed concern that the impact of free- 
trade agreements on intellectual property increasingly threatens the 
access of millions of people in poor countries to affordable medicines. 

As discussed on page 401, a major concern for scientific policy is 
that greater harmonization of EU and US policies might result in an 
overall decrease in regulatory standards on the environment, food 
safety or data protection, for example. Some say that such fears are 
overblown. Assessing what might be the true benefits and risks is not 
easy. Secrecy surrounds the negotiations, and the few details that are 
available usually come from unauthorized leaks. This is no way to 
conduct debate on such important matters. Civil society, including 
scientists and scientific organizations, must continue to badger for 
more access and information. At the very least, society should be given 
a similar level of access to that already granted to industry groups. 


Ts world’s two biggest economies, the United States and the 


EARTH INCORPORATED 

Beyond individual provisions, the nature of the proposed accord itself 
raises fundamental questions. Some leading economists — including 
Jeffrey Sachs at Columbia University in New York, previously a strong 
proponent of globalization, and Joseph Stiglitz, a 2001 economics Nobel 
laureate, also at Columbia — have called for TTIP to be rejected. They 
argue that it is not a trade agreement at all, but an undemocratic way for 
corporate lobbies to impose a narrow form of globalization — one that is 
focused more on commercial interests than on creating global systems 
to address such challenges as environmental and health crises (including 


biodiversity loss) and reducing economic inequality. 

One proposed provision — a mechanism for settling disputes that is 
designed to protect investors — deserves much wider debate. It would 
allow companies to bring lawsuits against sovereign states in private 
arbitration courts to obtain financial compensation for any regulatory 
actions that could harm their ‘expected’ profits. Similar mechanisms 
have already allowed the tobacco company Philip Morris to sue Uruguay 

over the introduction of national anti-smok- 


“Science policy- ing measures. And Swedish energy company 
setting 1s not, Vattenfall sued the German government in 
andmust not be, —_2009 for introducing stricter environmental 
chiefly about controls on coal-fired power plants; that case 
trade.” was settled out of court in 2010. The proposals 


mean that France, for example, might open 
itself up to lawsuits over its ban on fracking or its phasing out ofa class 
of pesticides thought to harm bees and other insects. 

The existence of such a legal mechanism at the highest levels of US 
and EU politics could have a dangerous and chilling effect, making 
countries think twice before introducing stricter regulations to protect 
public health or the environment, for example. It offers a ready-made 
route for corporate interests to usurp the right of sovereign states to act 
in the best interests of their citizens. In February, the European Univer- 
sity Association expressed concern that TTIP could harm the ability of 
authorities to determine the shape of their higher-education systems. 

The use of sound science to set regulations that affect trade is to be 
encouraged. But the science is not always unequivocal, and it must 
by no means be the only consideration. The practices of individual 
nations are forged from their own history and culture, resulting in dif- 
ferent approaches to how they structure health care, agriculture, food 
or environmental systems — and in how these are shaped by govern- 
ment and the market, and to what extent. National attitudes to science 
and technology are formed ina similar way; for example, in the level of 
risk people are willing to accept, or the ethical limits that such attitudes 
place on research or medical practices. 

That is why regulations differ worldwide. The dominant globaliza- 
tion ideology too often sees cultural differences, and citizens’ rights to 
determine the sort of society they desire, as trade barriers that must be 
overcome. This is misguided. Science policy-setting is not, and must 
not be, chiefly about trade. 

Done right, the harmonization of regulations and standards 
between the United States and the EU could bring enormous benefits. 
But the potential risks it carries for increased deregulation, and for the 
role and diversity of science-related policies in democratic societies, 
deserves much greater scrutiny — and much more transparent debate. 
Scientists, social scientists and their representatives, such as national 
academies, could perform a public service by contributing to and help- 
ing to clarify the many detailed scientific, technical, regulatory and 
social aspects of this complex and wide-ranging accord. = 
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Wakey wakey 


Sleeping - beauty papers offer hope that 
authors of uncited works are in good company. 


Nature — or indeed any scientific journal — will go on to 

change the world? How many will at least make a sizable dent 
in their academic field, inspire future work and perhaps overturn what 
has gone before? Citations by other researchers are the currency of 
modern science, a mark of professional approval that indicates influ- 
ence. But how long should one be expected to wait for them? Conven- 
tional wisdom says that the reach of a publication can be gauged by 
how many citations it attracts in the first five years. Which gives this 
week's authors until 2020 or so. 

Vincent Van Gogh had no time for conventional wisdom. The 
artist was famously ignored in his lifetime, yet his work L’Allee des 
Alyscamps sold for US$66 million in New York earlier this month. The 
books of Herman Melville were out of print when he died, and 1851's 
Moby-Dick did not surface in the public consciousness until years 
later. Art and science are not so different. Gregor Mendel was not 
recognized as the father of modern genetics until decades after his 
experiments with peas. 

Such cases sound extreme, but a study offers the presently unloved 
hope that delayed recognition might not be so rare after all. After 
analysing a database of 22 million academic papers, researchers have 
identified plenty of works that went for decades before they were rec- 
ognized and cited as important (Q. Ke et al. Proc. Natl Acad. Sci. USA 
http://dx.doi.org/10.1073/pnas.1424329112; 2015). One paper — 
entitled ‘Concerning adsorption in solutions’ — lay undisturbed for 
almost a century. Its author, the German chemist Herbert Freundlich, 
published the work in 1906 and died in 1941. His citation spike came 
in 2002. 


H« many of the research papers published in this week’s 


The apparent snub did little to damage his career — Freundlich 
went on to be widely recognized (and cited) as a pioneer of colloids 
research. And an initial lack of citations for a 1935 paper on quantum 
mechanics did not reflect the standing of its authors: physicists Boris 
Podolsky, Nathan Rosen and one Albert Einstein. Only in 1994 did 
this publication start to be cited extensively. 

Scientific papers typically accrue citations steadily, peak and then 

decline. Those that at first lie dormant, before 


“Plenty of works being discovered and enjoying a late surge, 
went for decades are dubbed sleeping beauties. In many cases, 
before they were — the awakening comes when the published 
recognized research finds applications in a different field, 
and cited as such as when statistical methods acquire a use 


important.” in biology. Some papers were ahead of their 
time, and described techniques that could not 
be exploited properly until the creation and curation of large modern 
databases. 

But creators of sleeping papers be warned: there is no guarantee that 
your prince will come. Although the latest study indicates that late- 
flowering papers are more common than previously thought, there 
remain plenty that are never cited — and never will be. 

Filippo Radicchi, a researcher in complex networks at Indiana Uni- 
versity Bloomington who worked on the study, says: “I expect, if you 
look at a paper that is 10 years old [and not cited] my guess is it will 
continue to have zero citations forever” (see Nature http://doi.org/4tb; 
2015). He is now trying to identify the papers that wake the dormant 
studies from their slumber with an important citation. 

Citation analysis is an increasing fact of academic life, and this study 
demonstrates, yet again, that the bare figures do not — and can never 
— show the full story. Some impact, and personal achievement, is 
simply difficult to measure, even during a productive career. 

Herbert Freundlich achieved much in life, and more after his death. 
But he did not get everything he wanted. As his obituary in Nature 
noted, he was a talented but frustrated musician. “He abandoned 
music for chemistry,’ it said, “when he concluded that he would never 
bea great composer”. There is still time, Herbert. = 


Silicon smarts 


A package of articles in Nature assesses the 
state of artificial-intelligence research. 


hen a select band of computer scientists met at Dartmouth 
W cite in Hanover, New Hampshire, in 1956 to begin 
work on a field they called ‘artificial intelligence; they were 
optimistic, to say the least. Their founding principle of developing 
machine intelligence was based on an assumption that human intel- 
ligence could itself be well characterized. They argued that: “Every 
aspect of learning or any other feature of intelligence can in principle 
be so precisely described that a machine can be made to simulate it.” 
Ask ten people to define human intelligence and you will get at least 
eleven answers. To a philosopher, intelligence is the absence ofa lack 
of intelligence. To psychologists it is what intelligence tests measure. 
Yet despite this fuzziness, the nature of artificial intelligence, in pop- 
ular culture at least, is sharply defined: computers and robots that can 
think and act like a human, and that have the potential to outthink and 
counteract us in most situations. That is probably why many people are 
disappointed with what even the most advanced robots can achieve, 
certainly compared with the impressive abilities of even the young- 
est humans. In their minds, Mozart was composing and performing 
music at five years old whereas robots can barely fold a towel. The 
pre-eminence of humankind, it seems, is assured. 
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And yet, break down the holistic expectation of intelligence into a 
series of distinct (if overlapping) abilities, and the machines fare some- 
what better. In a research paper on page 503, scientists define intelli- 
gence as the ability to predict the future. And they have built machines 
that can do it pretty well. Or at least they have built robots that can 
analyse the past to plan how to modify their own future behaviours 
if they are to continue functioning. The work’s implications for the 
continuing survival of feeble humanity are described in a News & 
Views article on page 426. 

Continuing the theme, a series of Comment articles starting on 
page 415 assesses the current state of debate over how society should 
respond, regulate and interact with intelligent machines. From 
autonomous weapons, which could be ‘clever’ enough to distinguish 
friend from foe and act accordingly, to medical diagnoses based on 
rapid and accurate analysis and interpretation of health-care data, 
these machines may not yet be classed as fully intelligent, but they 
are reaching a point at which they can mimic and potentially out- 
perform specific ‘intelligent human abilities. What should be done? 
In the case of drones and other armed intelligent machines, decision 
time is looming. 

Finally, a string of Review articles make up a Nature Insight on 
machine intelligence, starting on page 435. From machine-learning 
techniques and evolutionary computation to the design and construc- 
tion of malleable robots inspired by nature, the 
selection offers both a primer to the uninitiated 
and a useful summary of the state of the art. It is 
all, of course, essential reading. The machines, 
after all, are getting smarter. We should keep up. m 
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WORLD VIEW  jenniicosson 


growing population, there is increasing interest in getting 

more people to eat insects. Earlier this month, former United 
Nations secretary-general Kofi Annan told The Guardian newspaper 
that “eating insects is good for the environment and balanced diets”. 
This backed the view of a widely cited 2010 report from the UN Food 
and Agriculture Organization that stressed insects’ “exceptional 
nutritional benefits” and “fewer negative environmental impacts” 
when compared with “many mainstream foods” (see go.nature. 
com/6ln9dw). 

So far, there is little sign that these messages are being heeded. Turn- 
ing Westerners into insectivores has joined the long list of challenges 
that require behaviour change. As such, it holds broader lessons for 
other attempts to convert people to more sustain- 
able lifestyles, and is a useful case study. 

Many people already eat insects. There is 
evidence that insects have been a continuous 
part of our diet since the early hominins, and 
they are still eaten widely in south and east 
Asia, Africa, and South and Central America. 
(Although the practice is declining in some 
places as people switch to ‘aspirational’ Western 
lifestyles.) 

Western policy-makers and the media 
often send messages that lean heavily on this 
argument: they do it, so why can’t we? These 
messages use rational reasons to try to overturn 
the presumed major psychological objection to 
eating insects — disgust. This strategy assumes 
that the revulsion people feel when presented 
with, say, house-fly pupae (protein content 
62%!), is a cognitive process that can be addressed through education. 

There is little evidence that this strategy is working. Worse, appeal- 
ing to reason and responsibility reinforces a dilemma in the minds of 
consumers: many know that they can, in principle, eat insects, and 
perhaps that they should, but very few are willing to do so. 

What if it is not disgust that stops people eating insects? Disgust is 
linked to contamination and fear of disease. Insects are considered 
disgusting to eat, the theory says, because insects themselves eat ‘dirty’ 
foods. Yet many Westerners are happy to consume lobster, which 
scavenge from the sea floor, and pigs, which eat slops. Many insects, 
including some grasshoppers and ants, have the same diet as sheep. 

We should think less about combating disgust and more about 
appealing to taste. Most of the insects eaten in the world are cooked as 
part of interesting preparations that make them 


A s the world searches for a more sustainable future for its 
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Eat insects for fun, not to 
help the environment 


Insects are an excellent source of sustainable protein, but people will only 
be persuaded to eat themif they seem appealing, says Ophelia Deroy. 


Taste is affected by more than the flavour and smell of food. Also 
important is colour, the other visual images associated with the food 
and the name it is presented under. The re-naming of the (rather 
ugly) Patagonian toothfish as Chilean sea bass, for example, led 
to a sharp increase in sales. In one of the few studies to have been 
conducted so far, Belgian consumers were shown to accept insects 
(mealworms and house crickets) more readily when they were pre- 
pared using familiar flavours (R. Caparros Megido J. Sens. Stud. 29, 
14-20; 2014). 

Chefs and others who work intimately with food know the impor- 
tance of perception. I would never have eaten crickets if it was not 
for the beautiful golden dust that the chef sprayed them with before 
placing them on the side of a salad. It made them look unusual, 
and rare, but also made me anticipate a firm 
crunchy texture and a sharp taste. These expec- 
tations made the experience of eating them 
much less scary, and really enjoyable. And I do 
not remember it as ‘doing something good for 
the environment or ‘eating an insect alternative 
to animal protein. 

If we are serious about broadening the appeal 
of insects as food — and we should be — then the 
images we present to consumers should not be of 
industrially farmed meat substitute. 

Telling citizens and consumers that the food 
industry will have licence to add insect matter 
to food products will make them worried and 
sceptical. What is insect matter after all — which 
insects? How were they grown? Food is a ques- 
tion of trust. Insects that are farmed industrially, 
or imported from long distances by road, air or 
sea, might not be such a great help for the environment. We should 
focus instead on local insects. Studies suggest that people will be more 
likely to try a bee ice cream from their local honey producer than a 
stick of grilled scorpion from far away. 

We must stress the importance of cooking and recipes. And we 
should even move on from using the too-broad term insects. We do 
not eat mammals and birds; we eat cows, sheep and chickens. Insects 
could be identified by their real names, such as house crickets and 
wax-moth larvae. And we should not forget those with a sweet tooth: 
many insects lend themselves naturally to desserts. 

Most importantly, before we try to change the minds of consumers, 
we must understand their objections. And to overcome these objec- 
tions, food scientists, chefs and psychologists must work together to 
make insect dishes appeal as food, not as a way to save the planet. m 


Ophelia Deroy is a researcher in the Centre for the Study of the Senses 
at the School of Advanced Study, University of London. 
e-mail: ophelia.deroy@sas.ac.uk 
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CLIMATE SCIENCE 


Less sea ice, more 
Siberian snow 


Shrinking Arctic sea ice leads 
to heavier snows in western 
Siberia. This is some of the 
first evidence for how low 
sea-ice levels in the Arctic 
autumn affect precipitation 
in neighbouring regions the 
following winter. 

A team led by Martin 
Wegmann at the University 
of Bern, Switzerland, looked 
at snow measurements from 
820 locations across Russia, 
and modelled how moisture 
flows between the ocean 
and the atmosphere. They 
found that the Barents and 
Kara seas, north of the Ural 
Mountains, released more 
moisture into the air when 
the seas were not covered 
with ice. That moisture then 
ends up as snow in northern 
Russia. 

If sea ice continues to 
dwindle as expected, Eurasia 
and other continents may 
need to brace for heavy winter 
snowfalls. 

Environ. Res. Lett. 10,054015 
(2015) 


ANIMAL BEHAVIOUR 


Crafty crows keep 
their tools handy 


A crow species renowned for 
its use of tools has a time- 
saving trick — stashing the 
same tool for future use. 

New Caledonian crows 
(Corvus moneduloides) invest 
much time and energy turning 
sticks into hooks to extract food 
from small cavities. To see what 
happened to the tools when 
they were not in use, Barbara 
Klump and Christian Rutz at 
the University of St Andrews, 
UK, and their colleagues 
offered crows food hidden in 
holes in a block of wood. They 
found that, while they ate, the 
crows usually pinned their 
tool underfoot (pictured) or 
stashed it in a hole between 
uses. Crows foraging up high 
— where a fallen tool would be 
harder to retrieve — were more 
apt to store tools in holes than 


were birds lower 
down. 

Such safekeeping probably 
mitigates the cost of tool- 
making, allowing this complex 
behaviour to evolve, the 
authors say. 

Proc. R. Soc. B 282, 20150278 
(2015) 


CANCER GENETICS 


Skin riddled with 
cancer mutations 


More than one-quarter of 
cells in ageing, sun-exposed 
skin carry mutations that are 
known to drive cancer — even 
though the skin continues to 
function normally. 

To understand how healthy 
cells can mutate to form 
cancers, Philip Jones of the 
University of Cambridge, 
UK, Peter Campbell of the 
Wellcome Trust Sanger 
Institute in Hinxton, UK, and 
their colleagues sequenced 
74 cancer-linked genes in 
biopsies taken from excess 
eyelid skin removed from 
four healthy people aged 
over 55. More than 25% 
of the normal cells each 
contained thousands of 
cancer-causing mutations 
caused by sunlight — a 
similar frequency to that 
seen in many skin-cancer 
cells. These cells might form a 
reservoir with the potential to 
transform into tumours. 

Cancer drugs that target 
these mutations could 
damage normal cells with the 
same mutations, the authors 
suggest. 

Science 348, 880-886 (2015) 


METABOLISM 


Potential obesity 
drug from vine 


A compound found in the 
roots of the ‘thunder god’ vine 
could be a weight-loss drug, a 
study in mice suggests. 

Umut Ozcan of Boston 
Children’s Hospital in 
Massachusetts and his 
colleagues used a database 
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SOCIAL SELECTION 


Popular topics 
on social media 


Genomics paper probed on Twitter 


A recent Twitter conversation casting doubt on the 

conclusions of a genomics study has revived a debate about 

how best to publicly discuss possible errors in research. Yoav 

Gilad, a geneticist at the University of Chicago in Illinois, 

last month wrote on Twitter that fundamental errors in 

the design and data analysis of a December 2014 study led 

to an unfounded conclusion about the genetic similarities 

between mice and humans. Gilad and his co-author 

Orna Mizrahi-Man, a bioinformatics researcher also at 

the University of Chicago, have since detailed their data 

reanalysis in the open-access journal F1000Research (in 

which articles are openly peer-reviewed after publication). 
Michael Snyder, a geneticist at Stanford 
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of gene expression in 

human cells treated with 
various molecules to look 

for compounds that could 
increase the brain’s sensitivity 
to leptin, a hormone 

that suppresses appetite. 

The screen fished out a 
compound called celastrol, 
found naturally in the 

roots of thunder god vine 
(Tripterygium wilfordii). Its 
skinned roots are used in 
traditional Chinese medicine 
to treat inflammation. 

The team found that obese 
mice treated with celastrol ate 
less than untreated obese mice, 
and they lost up to 45% of 
their body weight. Genetically 
engineered animals that lacked 
normal leptin responses did 
not lose weight, however. 
Treatment of obese mice with 
celastrol also activated leptin 
signalling and reduced blood- 
sugar levels, with no obvious 
signs of toxicity. 

Cell http://doi.org/4sm (2015) 


MATERIALS 


Printed spider 
webs get tough 


Using a 3D printer, 
researchers have created. 
spider-web analogues out of 
elastic polymer threads and 
tweaked their architectures to 


University in California and co-author 
of the original paper, stands by his team’s 
study and its conclusions. 

F1000Research 4, 121 (2015) 


maximize the webs’ strength. 

Markus Buehler at the 
Massachusetts Institute of 
Technology in Cambridge 
and his colleagues printed 
synthetic webs (pictured) 
and added various sizes of 
load to test the webs’ strength. 
The authors found that they 
could strengthen the web by 
adjusting the diameter of the 
threads radiating out from the 
middle, and that of the threads 
that spiral around the web. 
They did so while keeping the 
mass and general geometry of 
the web constant. 

Strong synthetic webs 
could be used in applications 
such as reinforcing industrial 
materials, the authors say. 
Nature Commun. 6, 7038 (2015) 


> NATURE.COM 

For the latest research published by 
Nature visit: 
www.nature.com/latestresearch 


28 MAY 2015 | VOL 521 | NATURE | 397 


© 2015 Macmillan Publishers Limited. All rights reserved 


SEVEN DAYS nescnnss 


Animal drug trail 


Drug companies that sell 
antibiotics to US livestock 
farmers may be required 

to track where their drugs 

are going. Under a newrule 
proposed by the US Food 
and Drug Administration on 
19 May, drugmakers would 
have to collect data on how 
their drugs are being sold and 
used in different livestock. 
The agency plans to use the 
data to study how antibiotic 
usage in animals leads to the 
development of antibiotic- 
resistant bacteria. It is accepting 
comments on the rule until 
August. 


EVENTS 


LHC breaks record 


The Large Hadron Collider 
(LHC) smashed protons 
together at a record energy 

of 13 teraelectronvolts on 

20 May. After a two-year hiatus, 
researchers at CERN, Europe's 
particle-physics laboratory 
near Geneva, Switzerland, are 
gearing up to start taking data 
at this energy in early June. The 
latest collisions were designed 
to calibrate the systems that 
protect the machine and 
detectors from stray particles. 
Although the LHC’s detectors 
were not fully operational 
during the tests, experiments 
including ALICE captured 
the first bursts of particles 
produced by collisions at this 
energy. 


Mystery mission 
The US Air Force launched 
its uncrewed spaceplane, the 
X-37B, from Cape Canaveral, 
Florida, on 20 May for a 
secretive classified test. It is 
the fourth flight of the vehicle, 
which can remain in low- 
Earth orbit for more thana 
year — much longer than 

the space shuttle could. The 
vehicle's research purposes 


California deals with massive oil spill 


An oil spill from a broken pipeline near 
Refugio State Beach in California, which has 
contaminated 14 kilometres of coastline, 

led the state’s governor to declare a state 

of emergency in Santa Barbara County on 

20 May. Oiled birds and mammals have been 
recovered, some still alive, as government and 


include testing the effects of the 
harsh space environment on 
various materials. The launch 
also sent ten miniature satellites 
into orbit, as well as a privately 
funded solar sail, “LightSail- 


Surgeon misconduct 
Paolo Macchiarini, famed 

for transplanting artificial 
windpipes into patients, 
committed scientific 
misconduct in six papers 
concerning the procedure, 
according to an independent 
investigator. Bengt Gerdin at 
Uppsala University in Sweden 
found that Macchiarini’s papers 
made the operation sound 
more successful than it was. 
The Karolinska Institute in 
Stockholm, where Macchiarini 
is a visiting professor, asked 
Gerdin to examine allegations 
of scientific misconduct 
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against Macchiarini after four 
physicians who had cared for 
three of the trachea recipients 
filed complaints. Macchiarini 
now has two weeks to formally 
respond to the report. See page 
406 for more. 


| RESEARCH 
Gibbon rescue plan 


A plan to save the world’s rarest 
primate, which resides on 
China's Hainan Island, has been 
produced by an international 
team led by the Zoological 
Society of London. The plan, 
released on 19 May, cites a need 
to limit human activity in the 
20 square kilometres of forest 
that contain the last 25 Hainan 
gibbons (Nomascus hainanus), 
as well as increased monitoring 
and habitat improvements. If 

it becomes extinct, the gibbon 
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private workers try to determine the cause of 
the leak and contain the damage. According 

to Plains All American Pipeline, the company 
in Houston, Texas, that operates the pipeline, 

a maximum of 2,400 barrels have leaked. 
Around 500 barrels’ worth had been recovered 
by the end of 24 May. 


would be the first ape wiped 
out by human activity: much 
ofits decline from numbers 
of around 2,000 in the 1950s 
has been driven by habitat loss 
from logging (see Nature 508, 
163; 2014). 


Dolphin deaths 

The deaths of more than 

1,300 bottlenose dolphins 
beached in the northern Gulf 
of Mexico since early 2010 
may be related to the massive 
Deepwater Horizon oil spill, 
according to a study published 
on 20 May (S. Venn- Watson 

et al. PLoS ONE 10, e0126538; 
2015). The spike in dolphin 
deaths began shortly before the 
spill in April 2010, which has 
perplexed scientists. The study 
found that the dead animals 
hada much higher incidence 
of lung and adrenal lesions, 
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consistent with exposure to 
petroleum, compared with 
dolphins that were stranded 
before the spill or in different 
areas. See go.nature.com/ 
ibmbxr for more. 


Imperilled EU birds 


Loss and degradation of natural 
habitats such as grasslands 

and wetlands is threatening 
32% of about 450 wild bird 
species common in Europe. 
The most extensive report 

yet in the series The State of 
Nature in the European Union, 
by the European Environment 
Agency, finds that species 
including once-common 
farmland birds such as the 
skylark (Alauda arvensis) are at 
risk of extinction or in critical 
decline. Over-grazing and 
widespread use of fertilizers 
and pesticides are among the 
main threats to bird habitats. 
However, thanks to targeted 
conservation, about half of wild 
bird species are deemed secure. 


Amazon railroad 
Brazil and China agreed on 

19 May to launch feasibility 
studies on a transcontinental 
railroad that would cut 
transportation costs for soya 
beans and other exports. 
China, Brazil and Peru 

signed a memorandum of 
agreement last year on the 
5,300-kilometre railroad, 
which would cut through the 
Amazon and connect ports on 
the Atlantic and Pacific coasts. 


TREND WATCH 


Mote papers with huge lists of 
authors have appeared in the past 
few years, according to data from 
Thomson Reuters. Authorships 


of more than 1,000 were very 
rare in the early 2000s, but by 


2011 more than 130 such papers 


had been published. The size 


of these multi-contributor lists 
has also increased, reaching the 
record-breaking figure of 5,154 


this month for a paper in Physical 


Review Letters by two teams at 
Europe’ particle physics lab, 
CERN (see Nature http://doi. 
org/4sn; 2015). 


Chinese Premier Li Keqiang 
(pictured, centre) made 

the “Twin Ocean Railroad’, 
estimated to cost around 
US$10 billion, a cornerstone 
of his visit to Latin America. 
Activists are already raising 
concerns about impacts on 
the rainforest as well as on 
indigenous tribes. 


Retraction request 


The senior author on a much- 
publicized Science paper on 
changing people's views about 
same-sex marriage asked for 

it to be retracted on 19 May. 
Donald Green at Columbia 
University, New York, made the 
request after other researchers 
spotted irregularities in data 
provided by his co-author 
Michael LaCour, a graduate 
student at the University 

of California, Los Angeles. 
The study found that short 
conversations with a canvasser 
who is gay could encourage 
people to support same-sex 


MEGA-AUTHORSHIPS SOAR 


marriage. Science posted an 
‘expression of concern online 
on 20 May, noting that the 
request was made “because of 
the unavailability of raw data 
and other irregularities” 


Espionage arrests 


Two professors are among 
six Chinese men who were 
charged with economic 
espionage by the US 
Department of Justice on 

19 May. They are accused of 
stealing information from US 
companies about an acoustic 
chip that has consumer 
electronics and military 
applications. On 16 May, one 
of the accused, Zhang Hao, 
an engineering professor at 
Tianjin University in China, 
was arrested in Los Angeles, 
California, as he got off a 
flight to attend a conference. 
This follows other recent 
cases in which the United 


The number of papers with vast contributor lists has climbed in recent 
years — as has the size of these gargantuan authorships. 


= 
Oo 
fo} 


oi 
fo} 


Most authors per 
paper in 2000: 860 


[o) 


Number of papers with more than 1,000 authors 


2002 2004 2006 


Most authors per 
paper in 2012: 3,113 


"Most authors per 
paper in 2015 
so far: 5,154 


2008 2010 2012 2014 


SEVEN DAYS | THIS WEEK | 


30 MAY - 2JUNE 
The latest microbial 
research is presented to 
about 8,000 researchers 
from around the world 
at the meeting of the 
American Society for 
Microbiology in New 
Orleans, Louisiana. 
Topics include 
biomedicine, food safety 
and environmental 
microbiology. 
go.nature.com/bqdgs4 


1-5JUNE 

Physicists from a range 
of fields meet at the Kavli 
Institute for Theoretical 
Physics at the University 
of California, Santa 
Barbara, for the Closing 
the Entanglement Gap 
conference. It focuses on 
how entanglement can 
elucidate problems in 
quantum information, 
holography and 
condensed matter 
systems. 
go.nature.com/nrk7xe 


States has levelled accusations 
of economic espionage at 
scientists for the benefit of 
China. 


John Nash dies 

US mathematician John Nash 
and his wife Alicia were killed 
in New Jersey on 23 May when 
their taxi was involved in an 
accident. Nash won the 1994 
Nobel economics prize fora 
breakthrough he had made 

in 1950 on game theory. In 
March, he was awarded an 
Abel Prize for his contributions 
to the theory of nonlinear 
partial differential equations 
and its applications to 
geometric analysis. For much 
of his life, Nash, who was 86, 
struggled with schizophrenia, 
an experience portrayed in the 
2001, Oscar-winning film 

A Beautiful Mind. 


> NATURE.COM 
For daily news updates see: 
www.nature.com/news 
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INTERNATIONAL LAW 


Activists campaign against a trade treaty that they fear will allow US chlorine-washed chicken into the EU. 


Trade treaties 
centre on science 


Proposed deals have potential to boost research, but also to 
weaken health and environmental protections. 


BY DANIEL CRESSEY 


wo treaties that would govern most of 
Tite world’s trade — and change how 
nations across the globe use scientific 
evidence to craft regulations — inched closer 
to fruition this week. On 22 May, the US Sen- 
ate approved legislation that could speed up 
approval of the Transatlantic Trade and Invest- 
ment Partnership (TTIP) and the Trans-Pacific 
Partnership (TPP). 
Public attention has focused on the 


economic impact of the treaties: TTIP is 
between the United States and the European 
Union (EU) and the TPP is between a variety 
of North American and Asian nations (includ- 
ing the United States, Mexico, Japan, Australia 
and Malaysia, but not China or India) whose 
economies together account for around 60% 
of the world’s gross domestic product. But 
researchers, industry groups and non-gov- 
ernmental organizations (NGOs) are urging 
scrutiny of the treaties’ impact on science. 
Supporters point to the agreements’ 
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potential to boost research in pharmaceuticals. 
Critics say that the treaties could undermine 
countries abilities to protect the health of their 
citizens, as well as to cleave to nation-specific 
comfort levels with genetically modified (GM) 
organisms. 

“TTIP really falls squarely within the 
domain of science and technology policy, 
says Sebastian Pfotenhauer, a science-policy 
researcher at the Massachusetts Institute of 
Technology in Cambridge. “What is at stake 
is the sovereignty of countries to interpret 
scientific data and regulate risks in the way 
they choose,’ 

In the case of TTIP, tariffs between the EU 
and United States are already so low that the 
benefits to business are expected to come 
mainly from harmonization of regulations, 
which differ significantly between the two 
blocs. The EU is sceptical of GM crops, limit- 
ing what can be grown and imported there. In 
the United States, by contrast, the crops are 
widespread and are generally accepted by the 
public. However, the United States has tougher 
standards in other areas of trade: for example, 
it heavily restricts beef purchases from the EU 
because of fears over BSE (bovine spongiform 
encephalitis), also known as mad cow disease. 


RACE TO THE BOTTOM 

Some groups fear that agreements such as 
TTIP could be used to force all nations to 
adopt the rules of the country with the least 
regulation in each field. “It sets up a series of 
regulatory steps that will undoubtedly create 
a race to the bottom,’ says David Azoulay, an 
attorney with the Center for International 
Environmental Law (CIEL) in Geneva, Swit- 
zerland. “We believe that there are basically 
no benefits that will come from TTIP for peo- 
ple or the environment,’ adds Fabian Flues, a 
campaigner for Friends of the Earth Europe 
in Geneva. CIEL notes that the agrochemical 
industry is already considering using TTIP 
to increase the acceptable level of pesticide 
residues on food by pushing for the higher US 
threshold to be adopted in the EU. 

Others dismiss the idea of a ‘race to the 
bottom. The European Commission says that 
harmonization need not lower regulatory pro- 
tection. And Alberto Alemanno, a lawyer at 
the business school HEC Paris who has done 
consultancy work for the EU’s TTIP negotiat- 
ing team, says that TTIP aims to create a sys- 
tem that allows regulators to discuss when > 
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> their standards are equivalent, which 
does not necessarily mean making all regu- 
lations identical, as many people assume. 
If regulations are harmonized in some 
instances, he points out, citizens would be 
able to push for the higher standard — not 
the lower — to be adopted. 

Greater clarity on how medicines will 
be treated by regulators would be a boon 
for pharmaceutical research, says Richard 
Bergstrém, director-general of the European 
Federation of Pharmaceutical Industries and 
Associations in Brussels. It could also mean 
that drugs reach patients earlier, because 
companies would no longer have to apply 
separately to US and EU regulators, a process 
that can involve costly and time-consuming 
tests, especially for 
paediatric medicines. 


© ., 

eae “Tm both surprised 
i ° and quite frustrated 

bide thal which by the debate around 

scientific TTIP” he says. 

coop eration Another fear sur- 

is set to take rounding the trade 

place. agreements is that if 


they are ratified they 
will include a legal framework known as an 
investor-to-state dispute resolution mecha- 
nism, which would allow companies to take 
governments to court and overturn legisla- 
tion. A company could thus limit a govern- 
ment’s ability to regulate a certain chemical 
on health grounds if that were seen as an 
unreasonable restriction of the company’s 
trade as laid out in the treaty. Objectors to 
the treaties fear that a dispute provision 
would also discourage governments from 
passing strict legislation in the first place. 

The history of the North American Free 
Trade Agreement (NAFTA) and other such 
agreements is littered with such cases, says 
Pfotenhauer. The pharmaceutical firm 
Eli Lilly is currently suing Canada under 
NAFTA after the government invalidated 
patents on two of the company’s drugs in 
an argument over interpretation of clinical- 
trial data. Companies have also tried to use 
NAFTA to force Canada to allow toxic- 
waste exports and to include an additive in 
petrol that the nation’s regulators claimed 
was dangerous. 

These cases revolved around the 
interpretation of scientific knowledge, 
notes Pfotenhauer. Indeed, one thing that 
NGOs, lawyers and governments agree 
on is that scientists should care about the 
trade deals. “T'TIP is set to change the way 
in which scientific cooperation is set to take 
place across the Atlantic,” says Alemanno. 

And there may be little point in fighting 
the deals. Against a background of gloom 
and pro-free-trade governments, says Ale- 
manno, even if this particular trade nego- 
tiation breaks down, as a philosophy, “T'TIP 
is here to stay”. m SEE EDITORIALP.393 
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DRUG DEVELOPMENT 


Bacterial arms 
race revs up 


With antibiotic resistance on the rise, researchers are 
looking for new ways to treat infections. 


BY SARA REARDON 


ore than eight decades have passed 
M since Alexander Fleming's discovery 

of a fungus that produced penicil- 
lin — a breakthrough that ultimately spawned 
today’s multibillion-dollar antibiotics indus- 
try. Researchers are now looking to nature 
with renewed vigour for other ways of fight- 
ing infection. 

Few new antibiotics are in development, and 
overuse of existing ones has created resistant 
strains of deadly bacteria. “We need a change 
from what we have,” says 
Stephen Baker, head of 
medicinal chemistry for 
antibacterials at Glaxo- 
SmithKline in Collegeville, 
Pennsylvania. 

Baker will talk about 
some of the alternatives 
to antibiotics on 2 June 
at the American Society 
for Microbiology’s annual 
meeting in New Orleans, 
Louisiana. Here are a few of the therapies that 
scientists are exploring. 


other bacteria. 


PREDATORY BACTERIA 
Bacteria cause infection, but some can also 
fight it by preying on fellow microbes. Several 
researchers are beginning to test these preda- 
tory bacteria in animal models and cell cultures. 
The best-known species, Bdellovibrio 
bacteriovorus, is found in soil. It attacks prey 
bacteria by embedding itself between the host’s 
inner and outer cell membranes, and begins 
to grow filaments and replicate. “It’s like going 


Bdellovibrio bacteriovorus preys on 


into a restaurant, locking the door and starting 
to munch away,’ says Daniel Kadouri, a bac- 
teriologist at Rutgers University in Newark, 
New Jersey. The host bacterium eventually 
explodes and releases more B. bacteriovorus 
into the environment. 

Kadouri and others are also studying the 
therapeutic potential of the predatory bacte- 
rium Micavibrio aeruginosavorus. Anda team 
has engineered the gut bacterium Escherichia 
coli to produce peptides that kill Pseudomonas 
aeruginosa, a microbe that causes pneumonia. 

This preliminary research is attracting 
attention. The Pathogen 
Predators programme of 
the US Defense Advanced 
Research Projects Agency, 
which aims to treat soldiers 
who contract infections on 
the battlefield, announced 
nearly US$16 million in 
research grants this week 
to groups studying preda- 
tory bacteria. 


ANTIMICROBIAL PEPTIDES 
Plants, animals and fungi have vastly differ- 
ent immune systems, but all make peptides 
— small proteins — that destroy bacteria. 
Peptides from creatures such as amphibians 
and reptiles, which are unusually resistant to 
infection, could yield new therapeutics. 
Peptides with antibacterial activity have 
been isolated from frogs, alligators and cobras, 
among others, and some seem to be effec- 
tive in epithelial cell cultures and at healing 
wounds in mice. These peptides can be modi- 
fied to increase their potency, and several are 
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Global @ Alzheimer’s origins tied to rise 
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Peptides produced by the American alligator could add to the arsenal of antibacterials. 


in clinical trials. One, called pexiganan, based 
ona peptide from frog skin, is now in phase III 
clinical trials to treat diabetic foot ulcers. 

But synthesizing such molecules can be 
expensive, a hurdle that scientists must over- 
come to bring new peptide drugs to market. 


PHAGES 

Ofall the alternatives to antibiotics, phages — 
viruses that attack bacteria — have been used 
the longest in the clinic. Scientists in the Soviet 
Union began developing phage therapies in the 
1920s, and former Soviet countries continue 
the tradition. 

Phages have several advantages over anti- 
biotics. Each type attacks only one type of 
bacterium, so treatments leave harmless (or 
beneficial) bacteria unscathed. And because 
phages are abundant in nature, researchers 
have ready replacements for any therapeutic 
strain that bacteria evolve to resist. 

Mzia Kutateladze, who heads the scientific 
council at the Eliava Institute in Tblisi, Georgia, 
says that antibiotic resistance is driving more 
Western patients to phage-therapy clinics in 
Eastern Europe. The US National Institute of 
Allergy and Infectious Diseases in Bethesda, 
Maryland, now lists phages as a research prior- 
ity for addressing the antibiotic crisis. A clinical 
trial of a phage treatment for infections associ- 
ated with burns is planned by a consortium of 
European centres to start this summer. 


GENE-EDITING ENZYMES 

CRISPR, a gene-editing technique that has 
taken the scientific world by storm, is based 
on a strategy that many bacteria use to 
protect themselves against phages. Research- 
ers are turning that system back on itself 


to make bacteria kill themselves. 

Normally, the bacteria detect and destroy 
invaders such as phages by generating a short 
RNA sequence that matches a specific genetic 
sequence in the foreign body. This RNA snip- 
pet guides an enzyme called Cas9 to kill the 
invader by cutting its DNA. 

Scientists are now designing CRISPR 
sequences that target genomes of specific 
bacteria, and some are aiming their CRISPR 
kill switches at the bacterial genes that confer 
antibiotic resistance. 


METALS 

Metals such as copper and silver are the 
oldest antimicrobials. They were favoured by 
Hippocrates in the fourth century Bc as a treat- 
ment for wounds, and were used even earlier 
by ancient Persian kings to disinfect food and 
water. Only now are researchers beginning to 
understand how metals kill bacteria. 

Some groups are exploring the use of 
metal nanoparticles as antimicrobial treat- 
ments, although little research has been done 
in people. Because metals accumulate in the 
body and can be highly toxic, their use may 
be restricted mostly to topical ointments for 
skin infections. 

An exception is gallium, which is toxic to 
bacteria that mistake it for iron, but is safe 
enough in people to be tested as an intravenous 
treatment for lung infections. This summer, 
researchers at the University of Washington 
in Seattle will begin a phase II clinical trial of 
gallium in 120 patients with cystic fibrosis. 
Pilot studies found that the metal was mod- 
erately successful at breaking down microbial 
biofilms in the lungs and improving patients’ 
breathing. = 
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China accounts for almost one-third of global carbon dioxide emissions. 


CLIMATE CHANGE 


Scientists step in to assess 
carbon-emussions pledges 


Lack of common standard for national climate plans puts focus on independent ratings. 


BY JEFF TOLLEFSON 


/ | ‘h road to Paris is paved with promises. 
In the run-up to the United Nations 
climate summit in the French capital 

this December, nations are submitting pledges 
to limit their future carbon emissions. With a 
preliminary session in Bonn, Germany, sched- 
uled for 1-11 June, it is already clear that the 
commitments are insufficient to meet the UN’s 
goal of limiting future temperature increases to 
2°C above pre-industrial levels. It is difficult to 
say which countries should do more, but in an 
effort to move the process forward, climate sci- 
entists have started to independently assess the 
pledges. They have found that a Paris agreement 
to limit warming to 2 °C would require unreal- 
istic political compromises. 

The big picture is clear: on the basis of 
pledges made so far, the global average temper- 
ature is projected to rise by around 3°C above 
pre-industrial levels by 2100. For a reasonable 
chance of meeting the 2 °C goal, countries 
would have to strengthen their pledges suf- 
ficiently to limit all future greenhouse-gas 
emissions to the equivalent of about 1 trillion 
tonnes of carbon dioxide. 

“Once you say we are not doing enough, it 
begs the question, ‘Who should do more?’,” 
says Joe Aldy, an economist at Harvard 
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University in Cambridge, Massachusetts, and 
former special assistant for energy and envi- 
ronment to US President Barack Obama. 
That is the issue that climate scientists are 
trying to grapple with — without taking sides. 
“This whole issue of assessing these national 
targets is an important one, and it’s definitely 
a place where science can help,” says Niklas 
Héhne, a founding partner of the NewClimate 
Institute in Cologne, Germany. “But equity and 
fairness is something which is very much up to 
interpretation — what’s fair for one is not fair 
for another” 
The NewClimate Institute is part of a 
research consortium that produces the Climate 
Action Tracker, a com- 


“It’s going to prehensive assessment 
be alittle bit of the pledges made 
messy this thus far. Using the peer- 
year. " reviewed literature as a 


source, it rates pledges 
on the basis of how close they would bring the 
world to the 2°C goal if all countries adopted 
similar standards. 

The United States, European Union (EU) and 
China get a ‘mediun? rating from the Climate 
Action Tracker, meaning that their level of com- 
mitment, if adopted globally, would probably 
lead to more than 2°C of warming. Ina pledge 
submitted on 31 March, the United States vowed 
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to reduce its emissions to at least 26% below 
2005 levels by 2025. China has not submitted 
an official pledge, but committed in November 
to start reducing its emissions around 2030, the 
same year that the EU plans to have cut emis- 
sions to 40% below 1990 levels. 

Canada, which pledged on 15 May to reduce 
its emissions to 30% below 2005 levels by 2030, 
received an ‘inadequate’ rating from the tracker, 
meaning that a global application of its plans 
would lead to warming of 3-4°C or more. 


REPORT CARDS 

Criteria are key for assessing these pledges. 
Ina hypothetical agreement, warming could 
be limited to 2 °C by allocating shares of the 
remaining emissions budget among nations. 
But different methods can be used to allot 
emissions: nations’ proportions would differ 
depending on whether shares were distributed 
on the basis of emissions per capita, per dollar 
of gross domestic product, or according to the 
current distributions (see ‘Cutting the carbon 
cake’). In general, developing countries con- 
tend that those responsible for a greater share 
of historical emissions should get a smaller 
share of the future emissions budget. Rich 
countries counter that most emissions today 
and in the future will come from developing 
countries, which must also do their part. 


SOUVID DATTA 


SOURCE: M. R. RAUPACH ET AL. NATURE CLIM. CHANGE 4, 873-879 (2014) 


A scientific framework published in Nature 
Climate Change last September created two 
emissions-quota scenarios based on the con- 
trasting positions: one favoured poorer coun- 
tries by allotting emissions on a per capita basis; 
the other preserved the status quo by carrying 
forward countries’ current shares of emissions 
(M. R. Raupach et al. Nature Clim. Change 4, 
873-879; 2014). The study also produced a 
third, ‘compromise approach. The authors 
found that this scenario worked best to meet 
the 2°C target — but it would require politically 
and economically difficult shifts in energy use 
by all countries. 

A co-author of that study, Glen Peters of the 
Center for International Climate and Environ- 
mental Research in Oslo, has since analysed 
the US, EU and Chinese commitments using 
the same framework. His unpublished analysis 
finds that the US and EU pledges would eat up 
a large proportion of future emissions to meet 
the 2°C goal, leaving developing countries 
with a relatively small share going forward. 

Poorer countries would surely baulk at such 
a plan; they want room to grow and they see 
cheap fossil-fuel energy as a legitimate tool to 
lift their citizens out of poverty. Although it 
has yet to submit its pledge, India, for instance, 
has said that it will not limit its CO, emissions 
until it has risen to the global average of around 
5.1 tonnes per capita, from its current level of 
around 1.9 tonnes per capita. 

And then there is China. It now accounts 
for around 27% of global CO, emissions and 
its per capita emissions are 6% higher than the 
EU’. Peters’ calculations show that the emis- 
sions reductions required of China would be 


CUTTING THE CARBON CAKE 
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To keep global warming to within 2 °C above pre-industrial levels, the world must keep future carbon 
emissions to about 1 trillion tonnes. Allocating shares of that total to nations on the basis of population size 
results in a dramatically different allocation of emissions compared with carrying forward current distributions. 


Potential future emissions quotas, if allocated by population size 
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so drastic as to be politically unrealistic: even 
under the most lenient terms it would have 
to halt its rapidly growing emissions this year 
and cut them almost to zero by 2050. In that 
light, Peters says that climate scientists need to 
focus more on political viability. “The scientific 
community in a sense might be a little bit 
unprepared for doing this kind of thing” 

China and India issued a joint statement 
on 15 May that urged developed countries 
to make bigger emissions cuts before 2020, 
when a Paris agreement would probably take 
effect. The statement also asked developed 
countries to honour their pledge to provide 
US$100 billion per year in aid by 2020 to help 
poor countries to adapt to warming and reduce 
emissions. 


@ South Asia 
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@ Middle East 
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Ultimately, Aldy says, the UN climate 
process needs to incorporate a formal review 
similar to those used by other international 
bodies, such as the World Trade Organization 
and the International Monetary Fund. At those 
organizations, experts produce peer-reviewed 
reports on economic and trade policy using 
standardized reports of data and regulations 
submitted by member countries. The scientific 
analyses being done now may represent a small 
step in that direction. 

“It’s going to be a little bit messy this year,” 
Aldy says. “The question is whether that means 
we end up having a bunch of unproductive 
conversations about what countries are doing, 
or whether this turns out to be a productive 
process of experimentation.” m 


PUBLIC HEALTH 


Ebola R&D woes spur action 


Public-health officials make plans for how to speed up research-and-development 
responses to tropical-disease outbreaks. 


BY ERIKA CHECK HAYDEN 


fter failing to deliver new treatments 
A: time to fight the now-waning Ebola 

epidemic in West Africa, public-health 
experts are planning for speedier clinical trials 
during future outbreaks. 

“Everyone is delighted that we are going to 
no cases,’ says the World Health Organization's 
(WHO) assistant director-general Marie-Paule 
Kieny, “but research and development is too late 
by a factor of months. We need to see what can 
be done in order to have the products more rap- 
idly available for another outbreak.” 

In an interview last week at the annual 


WHO meeting of member states in Geneva, 
Switzerland, Kieny said that the agency is work- 
ing on a framework to identify pathogens that 
merit advanced research. The framework would 
catalogue the work needed to prepare potential 
treatments, vaccines and diagnostics for large- 
scale efficacy testing in the event ofan outbreak. 

The plan, a draft of which could be ready 
by the end of the year, will also spell out how 
researchers, companies and health agencies 
should agree to share data collected in publicly 
funded studies. 

It was only in March — a full year after the 
Ebola outbreak was first reported — that large- 
scale tests of candidate vaccines got under way. 
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Some of those trials have little chance of being 
completed because cases are now too rare: the 
US National Institute of Allergy and Infectious 
Diseases, which is testing a preventive vaccine, 
estimates that it would need to enrol more 
than 150,000 people in Guinea to show that its 
vaccine works. 

Although trials of Ebola drugs and vaccines 
were set up at a breakneck pace compared with 
the years it normally takes to organize large 
clinical trials, the researchers behind them have 
been frustrated by delays. “We have said that 
we need to conduct trials in this outbreak, and 
we've largely failed, and that’s desperately disap- 
pointing,’ says Trudie Lang, a global-health > 
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researcher at the University of Oxford, UK, 
who has been involved with clinical trials of 
Ebola treatments. 

Similar concerns arose during the severe 
acute respiratory syndrome outbreak of 
2003-04 and the 2009 H1N1 flu pandemic; after 
both events, researchers drafted study designs 
that could speed up future trials in outbreaks 
caused by respiratory pathogens. But there has 
not been a similar effort to coordinate trials for 
haemorrhagic diseases such as Ebola, which are 
less likely to spread to wealthy nations. 

The WHO framework aims to boost readi- 
ness to conduct trials for diseases that could 
cause global health emergencies. Although it 
has not been decided which diseases will qualify, 
Lassa fever, Rift Valley fever and Middle East 
respiratory syndrome are candidates. 


DIFFERENT PRIORITIES 

The delays in developing trials of Ebola 
treatments, researchers say, were due to a 
disconnect between organizations caring for 
patients and those leading the trials; refusals 
to share data; and failures to perform the effi- 
cacy and safety tests that could have been done 
before the outbreak (see Nature 511, 520; 2014). 

“We need to do the phase I studies in humans 
and have a small stockpile rather than stop at 
the animal stage,’ says England’s chief medical 
officer Sally Davies. 

Trials also lagged because of disagreements 
over logistics and ethics. For example, after the 
medical aid group Médecins Sans Frontiéres 
(Doctors without Borders) and researchers at 
the University of Oxford organized trials of the 
drug brincidofovir that did not include placebo 
groups, the US Food and Drug Administra- 
tion pushed for a randomized trial that would 
include untreated controls. The trial eventually 
went forward without a control group. 

Data-sharing roadblocks have also occurred 
at all levels. The experimental drug ZMapp was 
given to a handful of patients before supplies ran 
outin August 2014. But detailed information on 
the patients’ reactions to the drug has not been 
released, owing to fears that this would pre- 
vent researchers from publishing on the cases. 
Similarly, pharmaceuticals firm Chimerix of 
Durham, North Carolina, would not publicly 
reveal why it withdrew support for a trial of 
brincidofovir in late January after four patients 
had been treated, but Kieny says that the deci- 
sion was due to the drug’s lack of efficacy. 

Ata broader scale, the WHO cannot publicly 
release comprehensive epidemiological infor- 
mation because the data are owned by countries 
where cases occur. 

Kieny is trying to persuade publishers to 
agree that researchers can release analyses 
of patient and laboratory data during health 
emergencies without compromising their abil- 
ity to publish on it. She is also trying to broker 
an agreement with funding agencies that data 
from publicly funded safety and efficacy trials 
in outbreaks must be made widely available. m 
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SYNTHETIC WINDPIPES 


Paolo Macchiarini has misrepresented data on the success of artificial-trachea transplants, says a report. 


Surgeon commits 
misconduct 


Papers authored by Paolo Macchiarini misrepresented 
success of pioneering tracheal transplant procedure. 


BY DAVID CYRANOSKI 


surgeon famed for his pioneering 
Ave of synthetic windpipes 

has committed scientific misconduct, 
according to an independent investigation. 

Six published papers authored by thoracic 
surgeon Paolo Macchiarini, a visiting profes- 
sor at the Karolinska Institute in Stockholm, 
had misrepresented data from recipients of 
the artificial windpipes, or tracheas, reports 
Bengt Gerdin, a general surgeon and pro- 
fessor emeritus at Uppsala University in 
Sweden. The papers made the operation 
sound more successful than it was, says 
Gerdin, who was commissioned by the 
prestigious Karolinska Institute to examine 
Macchiarini’s clinical procedures. 

Gerdin also found that two of the papers’” 
described operations that had not received 
the necessary ethical approval, and that a 
seventh paper® authored by Macchiarini, 
reporting transplants of artificial oesophagi 
into rats, had misrepresented results. 

Macchiarini told Nature that he would 
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not comment on the investigation report, 
which is in Swedish, until he had seen an 
English version. In November 2014, he told 
Nature that he welcomed the investigation 
and was confident that “there is nothing sus- 
pect, unethical, inflated or misleading about 
anything I have done or reported”. He has 
two weeks to formally respond to Gerdin’s 
report. 

His research had been hailed as a bright 
spot for regenerative medicine, which has 
been slow to deliver synthetic materials to 
replace natural organs. Macchiarini’s pro- 
cedure involved bathing a polymer trachea 
in stem cells from the transplant recipient’s 
bone marrow. The idea was that when this 
was used to replace a damaged trachea, the 
cells would form the right type of tissue and 
create a seal with the surroundings. 

Macchiarini has put such artificial tracheas 
into eight people. The papers under investi- 
gation relate to just three of the procedures, 
and report that there were some signs that 
the synthetic tracheas had successfully inte- 
grated. Two of those transplant recipients 
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have died; one entered intensive care after 
the procedure and is still there. Macchiarini 
has previously told Nature that the problems 
faced by the patients were unrelated to the 
transplants. 

The investigation began after four 
Karolinska physicians who were involved in 
the care of those three patients filed com- 
plaints. Karl-Henrik Grinnemo, Matthias 
Corbascio, Thomas Fux and Oscar Simonson 
provided medical records that they alleged 
to be at odds with the published results, and 
called into question the paper on the rat 
model. 

In compiling the report, Gerdin says, he 
tried to avoid matters of interpretation that 
would become “a quarrel between scientists’, 
and stuck to facts such as whether the medi- 
cal records showed evidence of a follow-up at 
the intervals claimed. In some instances, pub- 
lications claimed improvement even though 
there was no evidence that the patients had 
been examined. “This is falsification,” says 
Gerdin. Speaking of Macchiarini, he adds: 
“The basic rule in science is to have all reports 
documented, but he doesn’t have them.” 
The rat-model paper included weight-gain 
and computed-tomography (CT) data that 
had been misinterpreted to suggest that the 
oesophageal graft was more successful than it 
was, says Gerdin. 


He concludes that the misrepresentations 
were deliberate: “If there is a mistake once, 
you might think it is random. If it happens 
several times, you begin to question whether 
it really is random? 

The investigation focused on Macchiarini, 
but Gerdin notes that Grinnemo, Corbascio 
and Simonson were co-authors of one of the 
papers’ included in the investigation involv- 

ing a patient, and 


“The basic rule that Grinnemo was 
in science is to also a co-author of 
haveallreports a second’. “I’m not 
documented, but _ saying they share the 
hedoesn’thave _ responsibility, but 
them.” one has to ask what 


they knew and what 
they didn’t know,’ says Gerdin. 

The three physicians did not respond to 
a request for comment, but Corbascio had 
previously told Nature that he was involved 
only “at a superficial level” with the trans- 
plant recipient. “I had complete confidence 
in Macchiarini,” he said. 

Separately, on 9 April, the Swedish Medical 
Products Agency (MPA) filed a complaint with 
the Swedish state prosecutor over whether 
proper permission was obtained to carry out 
the three synthetic tracheal transplants that 
feature in Gerdin’s investigation. The opera- 
tions took place at the Karolinska. Ann Marie 


IN FOCUS | NEWS 


Janson Lang, a clinical assessor in the clinical- 
trials department of the MPA, says that the 
synthetic tracheas meet the definition of an 
“advanced therapy medicinal product’, which 
requires agency permission before it can be 
given to patients, but that no application for a 
permit was made. 

It is not clear who would bear responsibility 
for the breach, if it is confirmed. Macchiarini 
told Nature that, as a visiting professor at the 
Karolinska, “it was never my responsibility to 
obtain any necessary permissions. I was not 
directed to do so, nor did I have the authority 
to do so.’ As Nature went to press, a Karolinska 
spokesperson said that the institute was pre- 
paring to post on its website “the facts about 
the three operations that Macchiarini per- 
formed and other matters connected to that 
case”. The state prosecutor is expected to report 
back within weeks. = 


1. Lancet 378, 1997-2004 (2011). 


2. Biomaterials 34, 4057-4067 (2013). 
3. Nature Commun. 5, 3562 (2014). 


CORRECTION 

The map in the News Feature ‘India by the 
numbers’ (Nature 521, 142-143; 2015) 
omitted the province of Meghalaya. The full 
map can be seen at go.nature.com/h2ydqpb. 
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Mexico — until it suddenly pivots out of control and plummets 
to the ground. 
Then a mortar round rises from its launcher, arcs high and begins to 
descend towards its target — only to flare and explode in mid-flight. 
On the desert floor, on top of a big, sand-coloured truck, a cubic 
mechanism pivots and fires an invisible infrared beam to zap one target 
after another. This High Energy Laser Mobile Demonstrator (HEL MD) 
is a prototype laser weapon developed for the US Army by aerospace 
giant Boeing of Chicago, Illinois. Inside the truck, Boeing electrophysics 
engineer Stephanie Blount stares at the targets on her laptop’s screen 
and directs the laser using a handheld game controller. “It has a very 
game-like feel,” she says. 
That seems only natural: laser weapons are a staple of modern video 


Sie the drone aircraft glides above the arid terrain of New 
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games, and ray-guns of various sorts were common in science fiction 
for decades before the first real-life laser was demonstrated in 1960. 
But they are not a fantasy anymore. The Boeing prototype is just one of 
several such weapons developed in recent years in both the United States 
and Europe, largely thanks to the advent of relatively cheap, portable and 
robust lasers that generate their beams using optical fibres. 

The output of these fibre weapons is measured in kilowatts (kW), 
orders of magnitude less than the megawatt-class devices once envi- 
sioned for the US Strategic Defense Initiative — an ultimately unsuc- 
cessful cold-war plan that sought to use lasers to disable ballistic missiles 
carrying nuclear warheads. 

But the modern, less ambitious, weapons are on the brink of real- 
world deployment. Tests such as those of the Boeing system show 
that the lasers have enough power to overcome threats from terror 
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groups — ata fraction of the price of conventional defences. “It’s a very 
cost-effective solution to taking out cheaply made weapons like small 
mortars or rockets made out of sewer pipe,’ says Blount. 

In late 2014, for example, the US Navy showed that a ship-mounted 
laser-weapon system called LaWS could target small boats, such as those 
used by terrorists and pirates. That experimental weapon is currently 
installed on the USS Ponce, an amphibious support ship in the Gulf. 

Many challenges to full-scale deployment remain, warn developers, 
from the need to boost the weapons’ power to the difficulty of operat- 
ing a laser in fog and clouds. But specialists in defence and security are 
starting to take lasers seriously. “After a nearly half-century quest, the 
US military today is on the cusp of finally fielding operationally relevant 
directed-energy weapons,’ wrote Paul Scharre, an advanced-technology 
specialist at the Washington DC-based Center for a New American 
Security (CNAS), in report on laser weapons released in April’. 


THE POWER PREDICAMENT 
Laser weapons have long fascinated weapons developers — most notably 
during the heyday of the Strategic Defense Initiative, nicknamed Star 
Wars, in the 1980s and 1990s. US spending on laser-weapons research 
peaked in 1989 when, according to the CNAS report’, the government 
spent the equivalent of US$2.4 billion in 2014 dollars. Funding has con- 
tinued at lower levels ever since. Yet the original goal, of being able to 
shoot down incoming ballistic missiles, proved unattainable. 

The trick with any laser weapon is to focus its energy into a spot that 
is small enough to heat up and damage the target — and to do that 
with a machine that is compact and portable enough for the battle- 
field. This is easier said than done. In 1996, for example, the US Air 
Force initiated the Airborne Laser project as one ofits contributions to 
defence against ballistic missiles. Because it was impossible at the time 
to generate the required megawatts of optical power electrically, the 
developers chose a chemical oxygen iodine laser (COIL) that could be 
fuelled by a chemical reaction. But the COIL was 
so bulky that it could only be carried on a Boeing 
747, and left little space for laser fuel. “It needed 
remote mixing units and chemicals weighing tens 
of thousands of pounds,” says Paul Shattuck, head 
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of directed-energy systems for Lockheed Martin Space Systems, which 
provided the project’s beam-control technology. 

Another major problem was the atmosphere, says Phillip Sprangle, 
senior scientist for directed-energy physics at the Naval Research 
Laboratory in Washington DC. Not only was the beam scattered by dust 
and natural turbulence, he says, but its passage caused ‘thermal bloom- 
ing. When the beam propagated at very high powers, Sprangle explains, 
“the atmosphere absorbed laser light, heating the air and causing the laser 
beam to spread out”. That spreading, in turn, dissipated the laser’s energy. 

The good news for the Airborne Laser project was that this issue, at 
least, hada solution: adaptive optics technology similar to that used by 
astronomers to clarify their view of the stars (see Nature 517, 430-432; 
2015). The technology uses mirrors to automatically distort the laser 
beam in a way that cancels the effects of the turbulence, with the same 
result as a pair of glasses correcting for aberrations in the eye. “As the 
laser beam passes through the atmosphere,’ says Shattuck, “it cleans up, 
and it’s nice and tight when it gets to the target.” 

By 2010, the adaptive optics was good enough for the Airborne Laser 
to destroy a ballistic missile in flight. By then, however, logistical issues 
such as the size problem had led the Department of Defense to lose its 
enthusiasm for energy weapons in general. It cancelled the Airborne 
Laser programme outright by early 2012. At the same time, the depart- 
ment’s spending on high-energy lasers in general was falling; it dropped 
from $961 million in 2007 to $344 million in 2014. 


FIBRES IN THE SPOTLIGHT 

The money did not vanish entirely: attention was already shifting to 
fibre lasers as a way to deliver results more economically. Fibre lasers 
were invented in 1963, and since the 1990s they have been advanced 
almost entirely by IPG Photonics in Oxford, Massachusetts. Whereas 
other solid-state lasers use rigid rods, slabs or discs of crystal to gener- 
ate the beam, and so have to be fairly large, fibre lasers use thin optical 
fibres that can be wrapped into compact coils (see ‘Fibre power’). The 
fibres can collect their optical energy from brighter versions of the cheap 
laser diodes used in DVD players, and then amplify the light to higher 
power, with overall electrical-to-optical conversion efficiencies greater 
than 30%. This is at least double the efficiency typical of other solid-state 
lasers, and close to that of chemical lasers such as COIL. And, being 
intrinsically long and thin, the fibres have a high surface area to volume 
ratio and can radiate away waste heat very quickly — an ability that helps 
to give the lasers along working life and low maintenance requirements. 

These advantages first attracted attention during the 1990s, when fibre 
lasers began to be used to beef up optical signals carrying Internet data 
through undersea cables. But since the early 2000s, IPG has focused on 
developing kilowatt-class industrial lasers for welding, drilling and cut- 
ting — devices that also attracted the attention of military researchers. 

Around 2010, recalls Shattuck, he and his colleagues at Lockheed 
Martin heard from Israeli civilians targeted by rockets launched from 
the Gaza Strip. “The mayor ofa village stood up and said, ‘Please, give 
me some kind of defence,” Shattuck says. This inspired Lockheed 
Martin to develop the Area Defense Anti-Munitions (ADAM) system, 
which uses an off-the-shelf 10-kW laser from IPG to keep costs down. 
Since 2012, the company has shown that ADAM can disable targets 
such as boats, drones and simulated small-calibre rockets from about 
1.5 kilometres away. Although unwilling to disclose the price of ADAM 
— or whether anybody has bought one — Lockheed Martin says that it 
is now ready to provide the system to customers. 

Blount is less reticent about Boeing's HEL MD prototype, which also 
uses acommercial 10-kW fibre laser. With the system drawing its power 
from the vehicle engine or a separate generator, she says, “it takes less 
than two cups of fuel to fire the laser for long enough to disable many 
targets.’ This makes it much cheaper to use for defence than conven- 
tional missiles. “An inexpensive missile is $100,000 and that’s one shot,’ 
says David DeYoung, Boeing’s director of directed-energy systems. “To 
shoot a laser-weapon system once is less than $10.” 

Blount stresses that the resurgence of laser weapons owes at least as 
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much to advanced image-recognition and targeting systems as to the laser 
itself. “The better the pointing and tracking system,’ she says, “the better 
able you are to put the beam on the most vulnerable point ofa target.” 

Thanks to computerized aiming, HEL MD can operate in wholly 
autonomous mode, which Boeing tested successfully in May 2014 — 
although the trials uncovered an unexpected challenge. The weapon's 
laser beam is silent and invisible, and not all targets explode as they are 
destroyed, so an automated battle can be over before operators have 
noticed anything. “The engagements happen quickly, and unless you're 
staring at a screen 24-7 you'll never see them,” Blount says. “So we've 
built sound in for whenever we fire the laser. We plan on taking advan- 
tage of lots of Star Trek and Star Wars sound bites.” 


STRENGTH IN NUMBERS 

Aiming and targeting may be battle-ready, but power is still a problem. 
A commercial laser’s 10-kW output is at the low end of what is useful 
for laser weapons. And using fibres puts limits on the beam’s power and 
quality — not least because at high powers, the cascade of photons surg- 
ing through the fibre can heat it up faster than it can radiate the energy, 
and can thus cause damage. To avoid this, researchers are working to 
combine the output from several lasers. 

The ideal way to do this would be ‘coherent combining’ in which the 
waves from each laser march together in tightly synchronized formation. 
This technique is widely used in radio and microwave applications, says 
Tso Yee Fan, a laser scientist at the Massachusetts Institute of Technolo- 
gy’s defence-oriented Lincoln Laboratory in Lexington. But coherence is 
much tougher to achieve with visible and infrared light. The waves from 
each laser must have almost identical wavelengths, the planes of their 
oscillations must precisely align, and the peaks and troughs of each wave 
must coincide. “In radio-frequency or microwaves, the wavelength’s a few 
centimetres,” Fan says. “In optics, the wavelength’s around a micrometre, 
so being able to do those kinds of controls has been really difficult” 

But that may not matter much, says Sprangle. In 2006, he and his team 
reported computer simulations suggesting that an ‘incoherent combina- 
tion’ of several fibre-laser beams hitting a single spot would be almost as 
effective as a coherent combination”. With either approach, he says, “when 
youre propagating over long ranges through atmospheric turbulence, 
you get approximately the same power on the target”. In 2009, his group 
confirmed this theory by using mirrors to combine 4 fibre-laser beams 
into a 5-centimetre spot on a target more than 3 kilometres away’. 

Building on Sprangle’s work, the US Office of Naval Research has devel- 
oped the 30-kW LaWS, which incoherently combines six commercial 
fibre-lasers. LaWS has been installed on the USS Ponce since September 
2014, and has been tested on objects such as small boats and drones. 

The missile-specialist MBDA Germany in Schrobenhausen has devel- 
oped a similar approach’. In October 2012, the firm successfully used 
its 40-kW combined fibre-beam system to destroy model artillery shells 
towed through the air some 2 kilometres away. MBDA’ tests have also 
helped to debunk the science-fiction idea that reflective armour would 
defend against laser weapons. They found that any dust on the mirrored 
surface would get burned in, and lead to the destruction of the target 
even faster than with a non-reflective surface. 

Markus Martinstetter from MBDA’s Future Systems Directorate 
argues that high-precision targeting minimizes the chances of acciden- 
tally hurting bystanders while trying to shoot down targets, especially 
compared with conventional explosives. “There is no risk from frag- 
menting ammunition and we only start the irradiation when the aim 
point is exactly on target,” he says. 

Lockheed Martin is also working on laser weapons that can take 
on targets that are more complex or farther away than can be tackled 
by its low-cost ADAM system. In March, for example, the company 
reported that its Advanced Test High Energy Asset (ATHENA) sys- 
tem could disable the running engine of a small truck mounted on a 
test platform. ATHENA uses a similar adaptive-optics system to the 
Airborne Laser, coupled with Lockheed’s Accelerated Laser Demon- 
stration Initiative (ALADIN) fibre-laser system. 
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The High Energy Laser Mobile Demonstrator can take down drones. 


ALADIN combines the output of several fibre lasers, each with a 
slightly different wavelength, into a single 30-kW beam. This “wave- 
length beam combining’ approach originated at the Lincoln Laboratory” 
and is similar to methods that channel Internet traffic into fibre-optic 
cables. Fan notes that this method is easier than coherent combining, 
but gives better-quality beams than incoherent combining, so it can 
more easily hit smaller targets from longer distances. 

Jason Ellis, a visiting fellow at the CNAS and lead author of the think 
tank’s laser-weapons report’, says that such developments convince 
him that fibre-laser weapons are coming of age — and that emerging 
advances could take them to hundreds of kilowatts and extend their 
range to hundreds of kilometres. 

Despite such advances, a February 2014 poll® of US national-security 
specialists found that just one-fifth believed that directed-energy 
weapon technologies would be mature within a decade. 

Michael Carter, a programme manager for photon science at 
Lawrence Livermore National Laboratory in Livermore, California, cau- 
tions that today’s lasers are a very long way from their science-fiction 
counterparts. “They're not yet the Star Trek phaser, he says. “People 
talk about speed-of-light engagement, but it still takes time to demolish 
targets. At the most basic level, if you can’t see it — if there’s too much 
rain or fog — your laser cant hit it” He suggests that the greatest value 
of the current generation of demonstration systems may be in working 
out how to handle such broader challenges before better lasers emerge. 
“Dont mistake what they're doing on the USS Ponce for a new strategic 
superiority,’ Carter warns. “It may be the first step in that direction but 
it’s not going to change the game by itself?” 

Even the weapons companies are cautious not to overstate their case. 
For example, MBDA expects that it will take 3-5 years for truly opera- 
tional systems to appear even in the tens of kilowatts range. And in 
some circumstances — such as a foggy day — conventional weapons 
will always be more effective. “You give the defender of the future both, 
and put the choice in their hands,’ DeYoung recommends. 

Despite their modest capabilities, Scharre claims that fibre-laser 
weapons could find a niche in US military defence in 5-10 years. “They 
may not beas grand and strategic as the Star Wars concept; he says, “but 
they could save lives, protect US bases, ships and service members.” m 


Andy Extance is a freelance writer in Exeter, UK. 
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Clever fish 


By revealing that fish cooperate, cheat and punish, 
Redouan Bshary has challenged ideas about brain evolution. 


BY ALISON ABBOTT 


fish were smarter than they are given credit for. It was 1998, and 

shary was a young behavioural ecologist with a dream project: 

snorkelling in Egypt's Red Sea to observe the behaviour of coral-reef 

fish. That day, he was watching a grumpy-looking grouper fish as it 
approached a giant moray eel. 

As two of the region's top predators, groupers and morays might be 
expected to compete for their food and even avoid each other — but 
Bshary saw them team up to hunt. First, the grouper signalled to the eel 
with its head, and then the two swam side by side, with the eel dipping 
into crevices, flushing out fish beyond the grouper’s reach and getting 
a chance to feed alongside. Bshary was astonished by the unexpected 


Ris Bshary well remembers the moment he realized that 
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Redouan Bshary 
studies social 
behaviour of 
coral-reef fish. 
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cooperation; if he hadn't had a snorkel in his mouth, he would 
have gasped. 

This underwater observation was the first in a series of surprising 
discoveries that Bshary has gone on to make about the social behaviour 
of fish. Not only can they signal to each other and cooperate across spe- 
cies, but they can also cheat, deceive, console or punish one another — 
even show concern about their personal reputations. “I have always 
had a lot of respect for fish,” says Bshary. “But one after the other, these 
behaviours took me by surprise.” 

His investigations have led him to take a crash course in scuba div- 
ing, go beach camping in Egypt and build fake coral reefs in Australia. 
The work has also destroyed the stereotypical idea that fish are dumb 
creatures, capable of only the simplest behav- 
iours — and it has presented a challenge to 
behavioural ecologists in a different field. 
Scientists who study primates have claimed 
that human-like behaviours such as coopera- 
tion are the sole privilege of animals such as 
monkeys and apes, and that they helped to 
drive the evolution of primates’ large brains. 
Bshary — quiet, but afraid of neither adven- 
ture nor of contesting others’ ideas — has given 
those scientists reason to think again. 

“Redouan has thrown down the gauntlet to 
us primatologists,” says Carel van Schaik, an 
expert in orang-utan culture at the University of Zurich in Switzerland. 
“He has made us realize that some of the explanations of primate intel- 
ligence that we have cherished don’t hold water anymore.” 


Stream fishing 

Bshary says that he was “pre-imprinted to like fish” As a child in Starn- 
berg, Germany, he played constantly in the stream at the edge of the fam- 
ily garden, building dams and pools and trapping fish. Passionate about 
animal behaviour, he studied evolutionary ecology at the University of 
Munich, and then did a PhD at the Max Planck Institute for Behavioural 
Physiology in Starnberg. But for his field work, he journeyed to the Céte 
d'Ivoire, where he followed tree-living monkeys and discovered that 
different species collaborate to reduce predator risk. 

His PhD supervisor, Ronald Noé, thought it would be “near impos- 
sible” to stalk monkeys that leap from tree-top to tree-top, but Bshary 
seemed to have a flair for it. On occasion, he even camouflaged himself 
under a leopard skin to imitate one of their predators. And he became 
fascinated by one question: what makes animals cooperate when stand- 
ard natural selection would predict selfish behaviour to be the norm? 

Noé, a primate behavioural ecologist now at the Hubert Curien 
Multidisciplinary Institute in Strasbourg, France, had come up with a 
biological market-based theory of cooperation. It proposed that animals 
cooperate to trade a specific commodity — such as food — fora service 
that would promote their survival, such as protection from a predator’. 
“An attractive theory — but, there were no strong data to support it; 
says Bshary. 

He looked around for a system where market forces might be operat- 
ing. And he found one when Hans Fricke, a fish ecologist working at 
the Max Planck institute, told him the strange tale of barrier-reef fish 
that operate a remarkable system of cooperation. ‘Cleaner’ fish, such as 
the brightly striped wrasse, will nibble parasites off the skin of ‘client’ 
fish in small coral territories known as cleaning stations. Bshary real- 
ized that this provided a perfect situation in which to test the market 
theory because client fish seemed to be trading food — in the form of 
parasites — for a skin-cleaning service. He decided to follow his hunch 
and study the coral-reef fish. 

There was one small problem: Bshary had never been scuba diving. 
He took his first lessons during a snowy winter in Lake Starnberg, then 
set off for the Red Sea, setting up camp in Ras Mohammed National Park 
in Egypt. Together with a few students, Bshary spent two full months 
a year camped on a scorching beach, sleeping under the stars, eating a 


“He would wake up 
and immediately 
put on his wetsuit 
and jump straight 
into the sea.” 
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diet of fruit and vegetables and doing four exhausting, 75-minute dives 
a day. “In the mornings he would wake up and immediately put on his 
wetsuit and jump straight into the sea,’ recalls former student Erica van 
de Waal, nowa research fellow at the University of Zurich. Armed with 
a plastic underwater writing slate, a pencil and a stopwatch, he shad- 
owed client fish, observing their interactions with the wrasse cleaner 
fish — and soon collected evidence of a well-functioning market. “For 
me this system was a gold mine,’ Bshary says, and he mined a lot of gold. 

He discovered, for example, that fish did not just trade parasites for 
skin cleaning; the cleaner fish also cheated on the deal. Rather than 
eating parasites, they actually preferred the nutritious protective mucus 
that covers fish skin, and were constantly tempted to take a quick, illicit 
bite of it. Bshary could count how often this 
happened — and therefore whether the cli- 
ents were getting a good or a bad cleaning 
service — because the clients gave a jolt when 
they were bitten. 

The market theory predicted that if there 
were lots of clients around, the cleaners 
would enjoy a seller’s market and would risk 
taking more bites of mucus. This is just like 
a mechanic getting away with shoddy car 
services when there are no competing busi- 
nesses in town. Bshary found this to be true, 
and he also found that the buyers could pro- 
test. Because some client fish roam large territories, they could choose 
to boycott any cleaning stations that deliver a bad service — just as 
someone who received a poor car service might travel farther to find 
a better garage’. 

While racking up evidence for the market theory, Bshary also 
observed a range of other social behaviours that had never been seen 
before in fish. He saw that unsatisfied clients sometimes punish cheat- 
ing cleaners by chasing them around, and that this punishment makes 
these fish less likely to cheat’. He saw cleaners ingratiating themselves 
with certain clients: they gave preference to visiting fish such as group- 
ers, rather than the smaller, local fish that did not have the option of 
going elsewhere. He found that the cleaners cheated less when they were 
being watched by other potential clients — a sign that they were buffing 
their reputations". And he saw reconciliation: if cleaners behaved badly, 
they then massaged the backs of offended clients with their pelvic fins”. 

It was all adding up to a catalogue of behaviours worthy of Niccold 
Machiavelli’s The Prince — but it was based on observation alone. 
Bshary needed to move to an experimental set-up where he could test 
how the fish behaved. And so in 2003, he began experiments at Liz- 
ard Island Research Station on Australia’s Great Barrier Reef. He was 
employed, however, on the other side of the world: first at the University 
of Liverpool, UK, and nowat the University of Neuchatel, Switzerland. 
“Tt was not difficult to sign up to a lifetime of fieldwork at warm coral 
beach locations,’ he admits. 

Over the next few years Bshary would capture fish in the wild reefs, 
house them in tanks for the duration of his experiments, then release 
them. He simulated the choice that cleaners make between parasites 
and mucus by building moveable plastic plates smeared with prawn, 
which the fish love, and fish flakes, which they enjoy less. In this set-up, 
the plates may be snatched away if the cleaners go for the prawns — just 
like a client fish may swim away if its mucus gets bitten too often. So the 
cleaners learned to cooperate and eat fish flakes instead. 

Such experiments take patience: some fish take a month just to adjust 
to the tanks. But in this way, Bshary proved that all the behaviours he 
had observed in the wild could be repeated under experimental con- 
ditions. And he discovered even more bizarre facts about the social 
lives of fish. In one experiment, he showed that when cleaners work in 
male and female pairs, as frequently happens in the wild, they are much 
less likely to cheat than when they work alone’; and that this is mostly 
because the female gets punished by being chased around by the male 
if she slacks off’. 
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A fake moray, made from a laminated photo, fools a grouper fish. 


Perhaps the laboratory’s most imaginative experiment involved the 
construction ofan entire fake coral reef, complete with dummy eels. The 
job fell to doctoral student Alex Vail, who glued together bits of coral 
rubble and put them in a hot-tub-sized tank. Vail then made models of 
moray eels by printing, gluing together and laminating two life-sized 
photographs, and attaching nylon strings that allowed him to pull the 
fake eels out of the fake coral, like a puppet. (Vail subsequently went on 
to a successful career in underwater filming.) Using this set-up, the team 
explored the behaviour that so shocked Bshary when he observed it in 
1998: a grouper and eel teaming up to flush out fish to eat. They showed 
that the grouper quickly learnt to signal — by turning and shaking its 
head — only to those moray eels that responded by moving towards, 
rather than away from, the fake reef. 

Bshary amassed ample evidence that fish engage in a range of social 
behaviours, and he assumed that all of them resulted from simple evolu- 
tion at work. Natural selection favoured fish that could learn, by simple 
association, which choices allowed them to efficiently rid themselves 
of parasites or access food. 

By 2010, Bshary’s thoughts were turning back to the world of prima- 
tology, in which he had been immersed during his PhD. He knew that 
he had observed in fish many of the behaviours that primatologists 
had shown in monkeys and apes. But primatologists had made grander 
claims for their observations. The ‘social brain’ theory argues that pri- 
mates evolved brains that are large for their body size to manage their 
unusually complex social systems. Only primate brains, the theory says, 
have the depth of cognitive analysis necessary to cooperate, deceive and 
solve other problems in a social world. 

Bshary disagreed. Maybe, he thought, these particular social behaviours 
in primates were also learnt by simple association and did not require the 
extra computing power of their big brains. And his findings meshed with 
those emerging from studies on the social behaviours of other animals, 
ranging from elephants to birds. “I think primatologists tend to make 
big claims because they look up the evolutionary chain and compare the 
primates’ behaviours to humans, instead of looking down the evolution- 
ary chain to see if the phenomena also existed in lower species, he says. 

At the time, primatologists were certainly not looking at fish. But that 
changed when Bshary teamed up with primatologist Sarah Brosnan at 
Georgia State University in Atlanta to directly pit the skills of cleaner fish 
against capuchin monkeys, chimpanzees and orang-utans in a foraging 
test. Each animal was presented with food on two differently coloured 
plates, one of which was a permanent fixture in their tanks or pens, 
whereas the other was temporary. The challenge was to learn to eat 
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from the temporary plate first, before it disappeared — and the scientists 
counted how many trials it took for the animal to figure this out. 

The cleaner fish solved the problem first’; they have evolved in their 
ecological niche to preferentially feast from visiting clients before they 
disappear. For fun, Bshary set up an equivalent ‘foraging’ test for his 
four-year-old daughter, complete with temporary and permanent plates, 
each bearing one chocolate M&M. Ina series of 100 different trials, she 
never learnt to eat from the temporary plate. 

The fish, meanwhile, were already aceing a more advanced test. When 
Bshary and Brosnan switched the coloured plates so that the permanent 
one suddenly became temporary and vice versa, the fish again understood 
the switch faster than the apes did (and equally as fast as the capuchins)*. 
This is known as reversal learning — and when the primatologists read 
that result, they took note. “Reversal learning has often been touted as the 
gold standard of general cognitive abilities,” says van Schaik — a sophis- 
ticated skill that correlates with brain size. “Since small-brained fish do it 
quite well, maybe we'll have to abandon this idea” 

“The ballis in our court,” says evolutionary psychologist Robin Dunbar 
of the University of Oxford, UK, who developed the social brain theory. 
Dunbar now accepts that the evolution of large brains was not driven by 
the need to carry out single ‘smart’ behaviours such as cooperation or 
deception. But that doesn’t mean the social brain theory has to be aban- 
doned, he says — just refined. He and other primatologists now propose 
that primates evolved bigger brains because they needed an all-round 
high level of general intelligence to survive the pressures of living in tight 
social groups — for example, to recognize large numbers of individuals 
and remember their complicated genetic and hierarchical relationships. 

Fish, which tend to have one-on-one interactions and live in loose 
schools, do not need to multi-task in quite the same way, Dunbar says. 
“Tt may boil down to the speed of cognitive processing and accuracy of 
judgement,” he suggests. 


Intelligence tests 

Michael Tomasello, an evolutionary psychologist at the Max Planck 
Institute for Evolutionary Anthropology in Leipzig, Germany, bounces 
the ball right back to Bshary, challenging him to show how smart fish 
really are. “Perhaps the most pressing question is how flexible and gen- 
eral fish cognition is,” he says — something Bshary is already testing by 
designing further fish intelligence tests. 

The mysteries of the fish brain deepened in 2009, when Bshary’s team 
chanced across a habitat in the reefs around Lizard Island that had rela- 
tively few fish and therefore less competition and social complexity. 
To Bshary’s surprise, the cleaner fish there turned out to be much less 
socially smart than cleaner fish just 20 metres away’. But their skill level 
may be optimal for their environment — another hypothesis that he 
now plans to explore. 

Whatever the next instalment brings, colleagues say that Bshary has 
already shifted a view of animal cognition in which humans and their 
primate cousins tower over everything else. “Primate chauvinism may 
now be poised to decline, thanks in large part to Bshary’s fish work,” 
says primatologist and ethologist Frans de Waal of Emory University in 
Atlanta, Georgia. “They now really do have to take on board that most 
species are going to have a type of smart intelligence.” = 


Alison Abbott is Nature’s senior European correspondent. 
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BAE Systems’ Taranis drone has autonomous elements, but relies on humans for combat decisions. 


Ethics of artificial 
intelligence 


Four leading researchers share their concerns 
and solutions for reducing societal risks from 
intelligent machines. 


Take a stand on 
AI weapons 


Professor of computer science, 
University of California, Berkeley 


The artificial intelligence (AI) and robotics 
communities face an important ethical 
decision: whether to support or oppose the 
development of lethal autonomous weapons 
systems (LAWS). 

Technologies have reached a point at 
which the deployment of such systems is — 
practically if not legally — feasible within 
years, not decades. The stakes are high: 
LAWS have been described as the third 
revolution in warfare, after gunpowder and 
nuclear arms. 

Autonomous weapons systems select and 
engage targets without human interven- 
tion; they become lethal when those targets 
include humans. LAWS might include, for 
example, armed quadcopters that can search 
for and eliminate enemy combatants in a 
city, but do not include cruise missiles or 
remotely piloted drones for which humans 
make all targeting decisions. 

Existing AI and robotics components 
can provide physical platforms, perception, 
motor control, navigation, mapping, tactical 
decision-making and long-term planning. 
They just need to be combined. For exam- 
ple, the technology already demonstrated for 
self-driving cars, together with the human- 
like tactical control learned by DeepMind’s 
DQN system, could support urban search- 
and-destroy missions. 

Two US Defense Advanced Research 
Projects Agency (DARPA) programmes 
foreshadow planned uses of LAWS: Fast 
Lightweight Autonomy (FLA) and Collabo- 
rative Operations in Denied Environment 
(CODE). The FLA project will program tiny 
rotorcraft to manoeuvre unaided at high 
speed in urban areas and inside buildings. 
CODE aims to develop teams of autono- 
mous aerial vehicles carrying out “all steps 
of a strike mission — find, fix, track, target, 
engage, assess” in situations in which enemy 
signal-jamming makes communication with 
a human commander impossible. Other 
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> countries may be pursuing clandestine 
programmes with similar goals. 

International humanitarian law — which 
governs attacks on humans in times of war 
— has no specific provisions for such auton- 
omy, but may still be applicable. The 1949 
Geneva Convention on humane conduct 
in war requires any attack to satisfy three 
criteria: military necessity; discrimination 
between combatants and non-combatants; 
and proportionality between the value of the 
military objective and the potential for col- 
lateral damage. (Also relevant is the Martens 
Clause, added in 1977, which bans weapons 
that violate the “principles of humanity and 
the dictates of public conscience”) These are 
subjective judgments that are difficult or 
impossible for current AI systems to satisfy. 

The United Nations has held a series of 
meetings on LAWS under the auspices of 
the Convention on Certain Conventional 
Weapons (CCW) in Geneva, Switzerland. 
Within a few years, the process could result 
in an international treaty limiting or ban- 
ning autonomous weapons, as happened 
with blinding laser weapons in 1995; or it 
could leave in place the status quo, leading 
inevitably to an arms race. 

Asan AI specialist, I was asked to provide 
expert testimony for the third major meet- 
ing under the CCW, held in April, and heard 
the statements made by nations and non- 
governmental organizations. Several coun- 
tries pressed for an immediate ban. Germany 
said that it “will not accept that the decision 
over life and death is taken solely by an auton- 
omous system’; Japan stated that it “has no 
plan to develop robots with humans out of 
the loop, which may be capable of committing 
murder” (see go.nature.com/fwricl). 

The United States, the United Kingdom 
and Israel — the three countries leading the 
development of LAWS technology — sug- 
gested that a treaty is unnecessary because 
they already have internal weapons review 
processes that ensure compliance with inter- 
national law. 

Almost all states who are party to the 
CCW agree with the need for ‘meaning- 
ful human control over the targeting and 
engagement decisions made by robotic 
weapons. Unfortunately, the meaning of 
‘meaningful is still to be determined. 

The debate has many facets. Some argue 
that the superior effectiveness and selectiv- 
ity of autonomous weapons can minimize 
civilian casualties by targeting only com- 
batants. Others insist that LAWS will lower 
the threshold for going to war by making it 
possible to attack an enemy while incurring 
no immediate risk; or that they will enable 
terrorists and non-state-aligned combatants 
to inflict catastrophic damage on civilian 
populations. 

LAWS could violate fundamental princi- 
ples of human dignity by allowing machines 
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NASA's Robonaut 2 could be used in medicine and industry as well as space-station construction. 


to choose whom to kill — for example, they 
might be tasked to eliminate anyone exhib- 
iting ‘threatening behaviour. The potential 
for LAWS technologies to bleed over into 
peacetime policing functions is evident 
to human-rights organizations and drone 
manufacturers. 

In my view, the overriding concern 
should be the probable endpoint of this 
technological trajectory. The capabilities of 
autonomous weapons will be limited more 
by the laws of physics — for example, by 
constraints on range, speed and payload — 
than by any deficiencies in the AI systems 
that control them. For instance, as flying 
robots become smaller, their manoeuvrabil- 
ity increases and their ability to be targeted 
decreases. They have a shorter range, yet 
they must be large enough to carry a lethal 
payload — perhaps a one-gram shaped 
charge to puncture the human cranium. 
Despite the limits imposed by physics, one 
can expect platforms deployed in the mil- 
lions, the agility and lethality of which will 
leave humans utterly defenceless. This is not 
a desirable future. 

The Aland robotics science communities, 
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represented by their professional societies, 
are obliged to take a position, just as physi- 
cists have done on the use of nuclear weap- 
ons, chemists on the use of chemical agents 
and biologists on the use of disease agents 
in warfare. Debates should be organized at 
scientific meetings; arguments studied by 
ethics committees; position papers written 
for society publications; and votes taken by 
society members. Doing nothing is a vote 
in favour of continued development and 
deployment. 


SABINE HAUERT 
Shape the debate, 
don’t shy from it 


Lecturer inrobotics, University of 
Bristol 


Irked by hyped headlines that foster fear 
or overinflate expectations of robotics and 
artificial intelligence (AI), some research- 
ers have stopped communicating with the 
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media or the public altogether. 

But we must not disengage. The public 
includes taxpayers, policy-makers, investors 
and those who could benefit from the tech- 
nology. They hear a mostly one-sided dis- 
cussion that leaves them worried that robots 
will take their jobs, fearful that AI poses an 
existential threat, and wondering whether 
laws should be passed to keep hypothetical 
technology ‘under control. My colleagues 
and I spend dinner parties explaining that 
we are not evil but instead have been work- 
ing for years to develop systems that could 
help the elderly, improve health care, make 
jobs safer and more efficient, and allow us to 
explore space or beneath the oceans. 

Experts need to become the messengers. 
Through social media, researchers have a 
public platform that they should use to drive 
abalanced discussion. We can talk about the 
latest developments and limitations, provide 
the big picture and demystify the technology. 
I have used social media to crowd-source 
designs for swarming nanobots to treat 
cancer. And I found my first PhD student 
through his nanomedicine blog. 

The AI and robotics communities need 


thought leaders who can engage with 
prominent commentators, such as physi- 
cist Stephen Hawking and entrepreneur- 
inventor Elon Musk, and set the agenda at 
international meetings such as the World 
Economic Forum in Davos, Switzerland. 
Public engagement also drives fund- 
ing. Crowdfunding for JIBO, a personal 
robot for the home developed by Cynthia 
Breazeal, at the Massachusetts Institute of 
Technology (MIT) in Cambridge, raised 
more than US$2.2 million. 

There are hurdles. First, many research- 
ers have never tweeted, blogged or made 
a YouTube video. Second, outreach is ‘yet 
another thing to do, 


and time is limited. “Through 
Third, it can take socialmedia, 
years to buildasocial- researchers 
media following that havea public 
makes the effort platform that 
worthwhile. And they should 
fourth, engagement yc fg drive 
work is rarely valued gq palanced 


in research assess- 
ments, or regarded 
seriously by tenure committees. 

Training, support and incentives 
are needed. All three are provided by 
Robohub.org, of which I am co-founder 
and president. Launched in 2012, Robohub 
is dedicated to connecting the robotics 
community to the public. We provide crash 
courses in science communication at major 
Aland robotics conferences on how to use 
social media efficiently and effectively. 
We invite professional science commu- 
nicators and journalists to help research- 
ers to prepare an article about their work. 
The communicators explain how to shape 
messages to make them clear and con- 
cise and avoid pitfalls, but we make sure 
the researcher drives the story and con- 
trols the end result. We also bring video 
cameras and ask researchers who are pre- 
senting at conferences to pitch their work 
to the public in five minutes. The results 
are uploaded to YouTube. We have built a 
portal for disseminating blogs and tweets, 
amplifying their reach to tens of thousands 
of followers. 

I can list all the benefits of science 
communication, but the incentive must 
come from funding agencies and insti- 
tutes. Citations cannot be the only meas- 
ure of success for grants and academic 
progression; we must also value shares, 
views, comments and likes. MIT robotics 
researcher Rodney Brooks’s classic 1986 
paper on the ‘subsumption architecture’, a 
bio-inspired way to 
program robots to 
react to their envi- 
ronment, gathered 
nearly 10,000 cita- 
tions in three 
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decades (R. Brooks IEEE J. Robot. Automat. 
2, 14-23; 1986). A video of Sawyer, a robot 
developed by Brooks’s company Rethink 
Robotics, received more than 60,000 views 
in one month (see go.nature.com/jqwfmz). 
Which has had more impact on today’s 
public discourse? 

Governments, research institutes, 
business-development agencies, and 
research and industry associations do 
welcome and fund outreach and science- 
communication efforts. But each project 
develops its own strategy, resulting in pock- 
ets of communication that have little reach. 

In my view, Al and robotics stakeholders 
worldwide should pool a small portion of 
their budgets (say 0.1%) to bring together 
these disjointed communications and 
enable the field to speak more loudly. 
Special-interest groups, such as the Small 
Unmanned Aerial Vehicles Coalition that 
is promoting a US market for commercial 
drones, are pushing the interests of major 
corporations to regulators. There are few 
concerted efforts to promote robotics and 
AI research in the public sphere. This 
balance is badly needed. 

A common communications strategy 
will empower a new generation of roboti- 
cists that is deeply connected to the public 
and able to hold its own in discussions. This 
is essential if we are to counter media hype 
and prevent misconceptions from driving 
perception, policy and funding decisions. 


RUSS ALTMAN 
Distribute AI 
benefits fairly 


Professor of bioengineering, genetics, 
medicine and computer science, 
Stanford University 


Artificial intelligence (AI) has astounding 
potential to accelerate scientific discovery 
in biology and medicine, and to transform 
health care. AI systems promise to help 
make sense of several new types of data: 
measurements from the ‘omics’ such as 
genomics, proteomics and metabolomics; 
electronic health records; and digital- 
sensor monitoring of health signs. 

Clustering analyses can define new 
syndromes — separating diseases that 
were thought to be the same and unify- 
ing others that have the same underlying 
defects. Pattern-recognition technologies 
may match disease states to optimal treat- 
ments. For example, my colleagues and I 
are identifying groups of patients who are 
likely to respond to drugs that regulate the 
immune system on the basis of clinical and 
transcriptomic features. 
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Kirobo, Japan’s first robot astronaut, was deployed to the International Space Station in 2013. 


In consultations, physicians might be 
able to display data from a ‘virtual cohort’ 
of patients who are similar to the one 
sitting next to them and use it to weigh 
up diagnoses, treatment options and the 
statistics of outcomes. They could make 
medical decisions interactively with such 
a system or use simulations to predict out- 
comes on the basis of the patient’s data and 
that of the virtual cohort. 

I have two concerns. First, AI technolo- 
gies could exacerbate existing health-care 
disparities and create new ones unless they 
are implemented in a way that allows all 
patients to benefit. In the United States, 
for example, people without jobs experi- 
ence diverse levels of care. A two-tiered 
system in which only special groups or 
those who can pay — and not the poor 
— receive the ben- 
efits of advanced 
decision-making 
systems would be 
unjust and unfair. 

It is the joint 
responsibility of 
the government 
and those who 
develop the technology and support the 
research to ensure that AI technologies are 
distributed equally. 

Second, I worry about clinicians’ abil- 
ity to understand and explain the output 
of high-performance AI systems. Most 
health-care providers will not accept a 
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complex treatment recommendation from 
a decision-support system without a clear 
description of how and why it was reached. 

Unfortunately, the better the AI system, 
the harder it often is to explain. The fea- 
tures that contribute to probability-based 
assessments such as Bayesian analyses are 
straightforward to present; deep-learning 
networks, less so. 

AI researchers who create the infra- 
structure and technical capabilities for 
these systems need to engage doctors, 
nurses, patients and others to understand 
how they will be used, and used fairly. 


Embrace a robot- 
human world 


Professor of computer science, 
Carnegie Mellon University 


Humans seamlessly integrate perception, 
cognition and action. We use our sensors 
to assess the state of the world, our brains 
to think and choose actions to achieve 
objectives, and our bodies to execute those 
actions. My research team is trying to build 
robots that are capable of doing the same 
— with artificial sensors (cameras, micro- 
phones and scanners), algorithms and actua- 
tors, which control the mechanisms. 
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But autonomous robots and humans 
differ greatly in their abilities. Robots 
may always have perceptual, cognitive 
and actuation limitations. They might not 
be able to fully perceive a scene, recognize 
or manipulate any object, understand all 
spoken or written language, or navigate in 
any terrain. I think that robots will com- 
plement humans, not supplant them. But 
robots need to know when to ask for help 
and how to express their inner workings. 

To learn more about how robots and 
humans work together, for the past three 
years we have shared our laboratory and 
buildings with four collaborative robots, 
or CoBots, which we developed. The 
robots look a bit like mechanical lecterns. 
They have omnidirectional wheels that 
enable them to steer smoothly around 
obstacles; camera and lidar systems to 
provide depth vision; computers for pro- 
cessing; screens for communication; anda 
basket to carry things in. 

Early on, we realized how challenging 
real environments are for robots. The 
CoBots cannot recognize every object 
they encounter; lacking arms or hands 
they struggle to open doors, pick things 
up or manipulate them. Although they can 
use speech to communicate, they may not 
recognize or understand the meaning of 
words spoken in response. 

We introduced the concept of ‘symbiotic 
autonomy’ to enable robots to ask for help 
from humans or from the Internet. Now, 
robots and humans in our building aid one 
another in overcoming the limitations of 
each other. 

CoBots escort visitors through the 
building or carry objects between loca- 
tions, gathering useful information along 
the way. For example, they can generate 
accurate maps of spaces, showing temper- 
ature, humidity, noise and light levels, or 
WiFi signal strength. We help the robots 
to open doors, press lift buttons, pick up 
objects and follow dialogue by giving 
clarifications. 

There are still hurdles to overcome to 
enable robots and humans to co-exist 
safely and productively. My team is 
researching how people and robots can 
communicate more easily through lan- 
guage and gestures, and how robots and 
people can better match their representa- 
tions of objects, tasks and goals. 

We are also studying how robot appear- 
ance enhances interactions, in particular 
how indicator lights could reveal more of a 
robot’s inner state to humans. For instance, 
ifthe robot is busy, its lights may be yellow, 
but when it is available they are green. 

Although we have a way to go, I believe 
that the future will be a positive one if 
humans and robots can help and comple- 
ment each other. m SEE INSIGHT P.435 
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Lithium is produced from brine in Chile’s Atacama Desert. 
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Knowledge and know-how 


Philip Ball examines a study unpicking the broad ramifications of information flows. 


he Atacama desert in Chile is the 

| world’s biggest single source of 

lithium. But Chile does not make its 

own lithium batteries; for that, the metal is 

exported to countries as distant as South 
Korea and China. Why? 

“Our world is marked by great inter- 
national differences in countries’ ability 
to crystallize imagination,’ writes César 
Hidalgo in Why Information Grows: that is, 
in the ability to turn the inventiveness of the 
human mind into concrete, saleable prod- 
ucts that embody information. 

Hidalgo’s book is largely concerned with 
explaining why those differences exist, and 
what the economic consequences are. In 
doing so, Hidalgo, a Chilean physicist at the 
Media Lab of the Massachusetts Institute of 
Technology in Cambridge, aims to embed 
his ideas in a broad view of how informa- 
tion economies work both in society and in 
nature, from Silicon Valley entrepreneurism 
to genetics. 

Information can now be accessed so easily 
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— you can find out 
about pretty much 
anything from any- 
where, or so it some- 
times seems — that 
it might naively be 
expected to reduce dis- 
crepancies in the local 
know-how needed to 
“crystallize imagina- 


Why Information 


Grows: The tion” But differences in 
Evolution of Order, the character and per- 
from Atoms to formance of national 
Economies economies persist in 
CESAR HIDALGO 3 

Basic: 2015 ways that are not sim- 


ply the result of inertia; 
nor can they be explained solely by different 
labour costs. The core of Why Information 
Grows is about the acquisition (or not) of the 
capability to put information to use. Many 
of Chile's recent exports to South Korea were 
copper (another resource that Chile has in 
abundance). The reverse trade was largely in 
vehicles and their parts. The trade balance 
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lies in Chile's favour financially, but Hidalgo 
points out that this masks a deficit in the flow 
of embodied information. That does not, of 
course, mean that Chilean automotive engi- 
neers are ignorant. Rather, the know-how to 
make cars reflects a whole suite of consid- 
erations about capacity: a trained workforce, 
access to materials, distribution channels, 
reputation, trading partners and so on. To 
truly understand these macroeconomic 
transactions, Hidalgo argues, we need to 
delve into the infrastructure and networks 
that make knowledge productive and allow 
know-how to accumulate. 

Part of this understanding has already 
been developed by economists, in particular 
Ronald Coase, whose 1930s work on transac- 
tion costs and the origin and growth of firms 
won him the 1991 eco- 
nomics Nobel. More 
recently, social capi- 
tal — connections, and 
the trust they engender 
— has been recognized 


For more on science 
in culture, see: 


IVAN ALVARADO/REUTERS/CORBIS, 
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as a key factor in the health of economies. a a 
Hidalgos particular contribution here is to 0 0 S | | rl é 
weave these ideas into a theory that provides 


a conceptual picture of a society’s collective 


accumulation of productive information “in = = On the Edge: The State and Fate of the World’s Tropical Rainforests 

firms and networks of firms”. He argues that ey Claude Martin GREYSTONE (2015) 

these entities are not so much the ek of that ONT Fi However crucial in terms of biodiversity, climate moderation and the 

accumulation as the means by which our soci- HE EDGE carbon cycle, rainforests are at bay. Almost one-fifth of the Amazon, 

eties have evolved to make it possible. = for instance, has been lost over 50 years. In this definitive report to 
What emerges is a new measure of eco- think tank the Club of Rome, conservation veteran Claude Martin and 

nomic complexity that acknowledges not contributors cover the territory from the bad old days of error-ridden 


just gross domestic product, but also diver- global forest monitoring to today’s smart mapping and strategies 
sity in the types of product that a nation for sustainable management. Martin predicts a further 100-million- 
produces. This index, Hidalgo says, offers hectare loss of primary forest by 2050 — but delivers 17 steps for 
a metric that predicts long-term economic averting the worst. Key reading for environmental policy-makers. 
growth. His thesis here is dense and not 
always lucidly explained, but it contains 
some innovative thinking about what drives 
growth that could help us to navigate the 
turbulence of the ever more interconnected 
global economy. 

Hidalgo wants to take his ideas about 
information and its productive exploitation 
beyond economics, especially into biol- 
ogy and ecology. In a nutshell, the task is to 
restore notions of meaning to information 
theory. The seminal text The Mathematical 
Theory of Communication (Univ. Illinois 
Press, 1949) by Claude Shannon and Warren 
Weaver explicitly expunged the idea that 
information in itself has meaning, as Hidalgo 
reminds us. But we often persist in thinking 
otherwise, not least in the delusions that we 
can do science by mining big data without 
a guiding theory, or that genomes contain 
‘instructions’ for making an organism. 

The distinction between knowledge and 
know-how, which seems evident enough 
in economics, can be profitably made in 
the natural sciences too. A gene sequence is 
information; know-how refers to the ability 
to make something from it, and this cannot 
reside in the information itself. “DNA has 
no knowhow and cannot unpack itself; it is 
a slave to the machinery needed to unpack 
it,” says Hidalgo. This seems obvious, but it 
is worth restating, if only to remind us that 
this unpacking process (and not further 
sequencing per se) is now the most impor- 
tant unanswered question in genomics. 

Whether all this really justifies Hidalgo’s 
claim to explain “what information is, 
where it comes from, and why it grows” is 
another matter. For one thing, with quantum 
mechanics being reframed as an information 
theory, it is unclear whether any classical 
arguments can give the whole picture. But 
that does not detract from the stimulating 
new perspectives on offer. Hidalgo has iden- 
tified a fertile seam, and all his book really 
needs (apart from some prose-tightening) 
is for the title to be turned into a question. = 


Dreams of Earth and Sky 

Freeman Dyson NEW YorRK REVIEW Books (2015) 

“The big blunders in this book are not accidental but intentional.” So 
physicist Freeman Dyson proclaims of the maverick ideas percolating 
through this collection of his New York Review of Books essays and 
book reviews from 2006 to 2014. Dyson delivers his iconoclasm 
with humour, honesty and off-the-cuff brilliance, whether riffing off 
microbiologist Carl Woese’s concept of living systems by seeing 
both thunderstorms and butterflies as “patterns of organization”, 
imagining ecology enriched by genome designers, or mulling over 
how Mary Shelley’s Frankenstein cast science as “an agent of doom”. 


Asteroids: Relics of Ancient Time 

Michael K. Shepard CAMBRIDGE UNIVERSITY PRESS (2015) 

The chunks of broken protoplanet we call asteroids are potential 
killers as well as scientific gold mines. Both thrills are explored in this 
comprehensive study of these “little worlds”. Planetary geoscientist 
Michael Shepard recounts the history of asteroid spotting, from 
pioneers such as Giuseppe Piazzi — who detected the first, Ceres, 
in 1801 — to today’s Catalina Sky Survey, run by the University of 
Arizona. Interweaving anecdotes from his own work with copious 
technical detail, Shepard leads us on an expert tour of a fecund 
branch of astronomy and its lessons for planetary science. 


Breathing Space: The Natural and Unnatural History of Air 

Mark Everard ZED (2015) 

The air — all 51 quadrillion cubic kilometres of it — is the biggest 
ecosystem on Earth, an atmospheric ocean connecting all life. 

Yet as science writer Mark Everard shows in his original study, the 
importance of this layered envelope of gases and aerosols is hardly 
reflected in policy. He argues for reassessment, looking in turn at the 
nexus of atmosphere, biosphere and human culture, and abuses 
such as air pollution and climate change. To secure “breathing 
space”, Everard calls for an approach integrating management of 
the atmosphere with that of other Earth systems. 


The Adventures of Sir Thomas Browne in the 21st Century 

Hugh Aldersey-Williams GRANTA (2015) 

A playful erudition permeates this biography of Thomas Browne, the 
seventeenth-century English polymath who wrote the debunking 
classic Pseudodoxia Epidemica, coined 784 new words and had an 
eye for patterns in nature. Hugh Aldersey-Williams zips between 

our time and Browne’s, and through the East Anglian landscapes 
Browne knew, to reveal the man and his work. Browne emerges as 
an exemplar of synthesized knowledge — and as such, curiously at 
home in today’s cultural remix of science and humanities. Barbara Kiser 


Philip Ball is a writer based in London. His 
latest book is Invisible. 
e-mail: p.ball@btinternet.com 
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LABORATORY HISTORY 


The chemistry chronicles 


Derek Lowe relishes a celebration of the lab on its long march through time. 


sk non-scientists to 
picture a laboratory, 
and the odds are that 


they will imagine the kind of 
space that a contemporary 
chemist would work in, com- 
plete with lab bench, fume 
cupboard, sink and shelves of 
glassware. Chemistry was prob- 
ably the first of the sciences to 
get a room of its own, and in 
The Matter Factory, Peter Mor- 
ris, keeper of research projects 
at London’s Science Museum, 
offers the first book-length 
treatment of how this happened 
and what has changed in labs 
over the years. 

Laboratory history is 
inseparable from the history 
of chemistry. It takes us from 
scientific instrumentation 
to teaching, from the disci- 
pline’s beginnings as the hidden art of 
alchemy to its modern status as a required 
course in science. It also takes in the rise of 
industrial research, the gradual raising of 
consciousness about safety and the personal 
impact of famous chemists. No discipline is 
immune to fashion, and a luminary — such 
as Robert Bunsen, who co-discovered spec- 
troscopic analysis, or organic-chemistry 
pioneer Justus von Liebig — could set the 
tone for years by building a laboratory to his 
personal specifications. 

As Morris shows, over all this stretches 
culture. Chemistry moved from a broadly 
French and English occupation during 
the Enlightenment, an era of experiments 
on gases and chemical compositions, to a 
German one in the mid-nineteenth century, 
with the Americans eventually beginning 
to take notes and draw up plans of their 
own in the twentieth century. Each nation 
had its own style, which blended with the 
practical aspects of a workplace to create 
distinctive looks. The most useful features 
(such as benches with drawers, and dedi- 
cated lines for gases and steam) are still to 
be found today. 

Morris deftly keeps all these threads from 
getting too tangled. The Matter Factory 
starts in the alchemist’s lair of the medieval 
era, dominated by the largest, hottest fur- 
nace available. The book makes clear that 
many engravings and paintings of alche- 
mists at work must be inaccurate, because 
they were drawn by people suspicious of 
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Antoine Lavoisier was central to the eighteenth-century chemical revolution. 


the whole enterprise. (Alchemists’ own 
drawings, such as those in the seventeenth- 
century text Mutus Liber, tended toward the 
wildly allegorical and the wilfully obscure.) 
The German scientist Georgius Agricola’s 
1556 De Re Metallica (On the Nature of 
Metals) is probably the first reliable guide 
to early lab technique such as the handling 
of strong acids. In later illustrations, the fur- 
nace shrinks, then disappears entirely, and 
tables and benches appear. 

Fume hoods began to take on a modern 
look by the 1920s, but separate cupboard- 
like spaces for experimentation (at first 
unventilated) go back to at least the mid- 
nineteenth century. Photographs from that 
time show benches and shelving progres- 
sively stretching across the floors and walls, 
and lines for water, gas, steam and (by the 
early twentieth century) electricity thread- 
ing into the picture. 
Teaching labs gradu- 
ally separate them- 
selves from research 
labs, and industrial 
labs begin to stand on 
their own. Along the 
way, we chemists lose 
such amenities as the 
chemical museum, 


hich ispl d The Matter 
i ns a a Scie Factory: A History 
erectile &P €NS, of the Chemistry 
compounds and lab Laboratory 


equipment — a once 
common annex to 


PETER J. T. MORRIS 
Reaktion: 2015. 
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many large laboratories, now 
completely forgotten. 

The nineteenth century saw 
perhaps the greatest number of 
changes in laboratory layout, 
as new instruments such as 
Bunsen burners and new styles 
of working, including team 
research, proliferated. Much 
of the book focuses on this era. 
The pace speeds up noticeably 
in the twentieth century, prob- 
ably because the major features 
of the modern lab were already 
largely in place, down to the 
pegboard over the sink for dry- 
ing glassware. 

Today, one chemistry lab 
tends to look very much like 
another. In my own career, 
I have ‘lived’ in at least 14 
labs, and their similarities are 
much more numerous than 
their differences, which largely come 
down to how instruments such as high- 
performance-liquid-chromatography 
equipment are arranged around the walls. 
Ifa chemist from, say, the early nineteenth 
century entered one, the glassware would 
be familiar, as would the bench and fume 
cupboard, although there might be a few 
cries of “Luxury!” at the sight of dispos- 
able pipettes and electronic balances. (The 
humming boxes around the periphery 
would, of course, be harder to place.) 

The changes to labs of the future will 
probably come down to variations in the 
number and capacity of automated instru- 
ments. But the space will probably look 
broadly similar to what we have now, which 
will no doubt disappoint some industrial 
designers looking to make a big splash. The 
Matter Factory, however, is the story of the 
years (and centuries) during which such 
splashes could be made, when chemistry 
was finding out what it could do. It covers a 
lot of ground, and brings together many old 
drawings, plans and photographs that are 
otherwise scattered through a bewildering 
literature trail. It should remain the defini- 
tive history of the chemistry lab for many 
years. 


Derek Lowe has been working in drug 
discovery labs at several companies 
since 1989, and blogs at In the Pipeline 
(www.pipeline.corante.com). 

e-mail: derekb.lowe@gmail.com 
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The mystery of the 
microscope in mud 


A rare brass microscope was 
discovered in mud dredged 
from canals in Delft, the 
Netherlands, in December 2014. 
The instrument (pictured, left) 
is thought to have been made 
by Dutch pioneer microscopist 
Antony van Leeuwenhoek 
(1632-1723; see P. Ball 

Nature 520, 156; 2015) and 

is an important find — ifit is 
genuine. The last Leeuwenhoek 
microscope in public hands was 
a silver instrument that sold for 
more than US$500,000 in 2009. 

The ten microscopes attributed 
to Leeuwenhoek are diminutive, 
comprising a biconvex lens 
sandwiched between two riveted 
body plates. A positioning screw 
holds a stage block and a pin to 
secure the specimen. A smaller 
screw moves the stage block to 
focus the image. 

The Delft microscope’s design 
and dimensions are comparable 
to those of a Leeuwenhoek 
microscope in the Boerhaave 
Museum in Leiden. Although 
the lens is abraded, it would have 
given about 160 x magnification. 

The Delft find cannot bea 
replica. First, it has a rounded 
body plate — a comparable 
microscope in the Boerhaave 
Museum is rectangular (pictured, 
right), and a copy would have an 
identical configuration. Second, 
my analysis of the screw thread 
at the Cavendish Laboratory at 
the University of Cambridge, UK, 
shows that it is unlike threads 
produced by modern mechanical 
methods. The distance between 
the threads (or pitch) of the 
main screw is 0.9 millimetres, 
comparable with that in authentic 
instruments. 

Iam preparing a protocol 
through which the production 
details of all existing 
Leeuwenhoek microscopes can 
be scrutinized. Scanning electron 
microscopy should then be able to 
detect any forgeries. 

Brian J. Ford Cardiff University, 
UK. 
brianjford@cardiff.ac.uk 


Deforestation soars 
in the Amazon 


Deforestation of the Brazilian 
Amazon is on the rise and not 
declining as you imply (Nature 
520, 20-23; 2015). Forest 
clearance has exploded since 
August 2014, according to satellite 
data from Brazil's National 
Institute for Space Research and 
the sustainable-development 
organization Imazon. Monthly 
clearance rates are several times 
those for the same months in the 
preceding two years. 

The data on deforestation 
drivers in the studies you discuss 
predate the recent deforestation 
surge (D. Nepstad et al. Science 
344, 1118-1123; 2014 and H. K. 
Gibbs et al. Science 347, 377-378; 
2015). Brazil’s currency, the real, 
is in free-fall, with no end in 
sight. Exchange rates against the 
US dollar mean that exports from 
Brazil are now considerably more 
profitable than last year, when 
the deforestation decline was 
occurring. 

Prospects for the near term are 
not good. A key 2008 resolution 
from the Central Bank of Brazil, 
which links financing from 
public banks to environmental 
compliance, now offers only 
fragile protection in the face 
of a congress dominated by 
those in favour of agribusiness. 
Creation of new protected areas 
has been essentially paralysed, 
existing reserves continue 
to have their official status 


removed, and government 
expenditure on enforcing 
environmental laws has been cut 
by 72%. Furthermore, political 
appointments are sending an 
anti-environmental signal to 
deforesters (see Nature 517, 
251-252; 2015), and plans for 
Amazonian roads continue as 
fast as money allows. The “battle 
for the Amazon’ is far from won. 
Philip M. Fearnside National 
Institute for Research in Amazonia 
(INPA), Manaus, Brazil. 
pmfearn@inpa.gov.br 


UK bioscientists 
push for crop policy 


In assessing the next generation 
of plant-breeding techniques 
such as gene editing (Nature 
520, 131-132; 2015), the 
European Union (EU) should 
take into account criticisms of 
its current regulatory system for 
new crop varieties. 

Last year’s statement by 
the UK Biotechnology and 
Biological Sciences Research 
Council on genetic techniques 
for crop improvement suggests 
that EU regulations need to 
move beyond production 
methods and focus more on the 
benefits of newly introduced 
genetic traits (see go.nature. 
com/gsoiza). 

The UK National Institutes of 
Bioscience, which contributed 
to the statement, seek prompt 
realization of the societal and 


environmental benefits of 

these latest technologies. They 
are leading efforts to ensure 
that the best policy decisions 
are being made for plant (and 
animal) breeding in Europe and 
worldwide. 

David Hume National Institutes 
of Bioscience, UK. 
david.hume@roslin.ed.ac.uk 


Marine protection 
is a UK priority 


Scotland’s government has 
responsibility for nature 
conservation in 61% of UK 
waters. As programme manager 
for the Marine Conservation 
Society in Scotland, I agree that 
the United Kingdom and its 
devolved governments should 
be making marine conservation 
a priority (see also Nature 520, 
585-586; 2015). 

This summer, Scotland could 
become a global conservation 
leader following proposals for 
Marine Protected Areas (MPAs) 
for highly mobile species, 
including minke whales and 
basking sharks. These areas 
will soon be subject to public 
consultation. The hope is that 
they will then augment the 
30 Scottish MPAs designated 
in summer 2014. Even at some 
of these sites, damaging fishing 
activity may still be continuing 
(see go.nature.com/hzkdzd). 
Present and future governments 
in Scotland and Westminster 
must take steps to keep the ‘P’ in 
MPAs. 

Calum Duncan Marine 
Conservation Society, Edinburgh; 
and Scottish Environment LINK, 
Perth, UK. 
calum.duncan@mcsuk.org 


CORRECTION 

The Correspondence ‘Interim 
initiative for health in Iran’ 
(Nature 521, 32; 2015) wrongly 
implied that A. A. Velayati is a 
volunteer for non-profit group 
IAPA; only H. Jamaati and S. M. 
Hashemian are IAPA volunteers. 
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For News & Views online, go to 
nature.com/newsandviews 


ARTIFICIAL INTELLIGENCE 


Robots with instincts 


An evolutionary algorithm has been developed that allows robots to adapt to unforeseen change. The robots learn 
behaviours quickly and instinctively by mining the memory of their past achievements. SEE LETTER P.503 


CHRISTOPH ADAMI 


mous with the ability to predict the future. 

Because doing so quickly can often mean 
the difference between life and death, our 
brains have evolved to be able to search the vast 
number of potential futures easily. How is such 
a feat accomplished? On page 503 of this issue, 
Cully et al.' attempt to answer this question by 
demonstrating that robots can learn to recover 
quickly and robustly from physical damage — 
a sudden event that requires them to adopt a 
new behavioural strategy to continue func- 
tioning. The robots (a six-legged mobile robot 
and a robotic arm; Fig. 1) use a trial-and-error 
algorithm that lets them tap into the experi- 
ences they have accumulated over a simulated 
lifetime, to quickly find optimal compensating 
behaviours as if by instinct (see Supplementary 
Video 1 in ref. 1). 

Accurate prediction of events in complex 
environments requires experience, an under- 
standing of ‘how the world works, and the 
capacity to evaluate one’s own actions in the 
context of those of others. It can be argued 
that the further out in time an organism or 
a machine can make accurate predictions of 
the future, the more intelligent it is. Using this 
definition, even simple organisms have some 
intelligence: microbes such as Escherichia coli, 
for example, make predictions about where 
they must move to find higher concentrations 
of sugars, and squirrels anticipate the winter by 
stashing away nuts. 

Among animals, humans have arguably the 
highest level of intelligence, because we can 
anticipate events hundreds, thousands or even 
millions of years in the future — albeit largely 
in domains that do not involve the actions of 
people, such as planetary orbital dynamics. 
How can we begin to understand the cogni- 
tive underpinnings of a predictive capability 
that is, to a smaller or larger extent, inherent 
in all forms of life on Earth? One way, follow- 
ing Richard Feynman’s dictum “What I can- 
not create I do not understand’, is to recreate 
intelligence in a machine or robot. 

Attempts to create robot intelligence have 
come and gone with limited success in the past 
half-century, and it seems as if the goal of cre- 
ating a machine with human-like intelligence 


| Fe by some accounts, is synony- 
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Figure 1 | Adaptive machines. Cully et al.’ have designed an algorithm that allows robots to develop 
strategies for overcoming the effects of damaged limbs. Two robots were used: a, a hexapod (width 


50 centimetres); b, a robotic arm (length 62 cm). 


remains elusive — even as great strides are 
being made. Notwithstanding the successes 
of chess-playing programs, IBM’s artificially 
intelligent computer Watson, and the advent 
of algorithms for self-driving cars, true robot 
intelligence still eludes us. 

Previous studies in the field of robot cogni- 
tion”’ have suggested that the ability to plan 
future actions hinges on the ability to recreate 
a model of the world inside the robot brain — 
an abstract version, but one that is accurate 
enough for mental trials and errors to quickly 
reveal the best strategy to adopt. But even 
supposing that these model representations* 
can be generated, how can the vast ‘space’ of 
likely future actions be searched quickly and 
efficiently? 

Cully et al. subjected their robots to sev- 
eral different unforeseen changes in the 
machines’ morphology (akin to damage), and 
then asked them to find movement strategies 
that would compensate for the injury. Before 
being injured, the robots used an algorithm to 
establish a baseline of possible actions, which 
they used after injury to try out moves that 
were likely to be successful before deciding 
on any particular compensatory behaviour. 
Even though the range of possible behav- 
iours (the behaviour space) for a robot might 
theoretically be infinite, this baseline can be 
established because, in reality, a robot’s actions 
are constrained by its morphology. 

A hexapod robot such as that studied by 
the authors is controlled by 36 parameters, 
but most of the strategies (sequences of motor 
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activations in a 36-dimensional space) make 
no sense. Within the robot’s ‘embodiment”® 
— the way in which the robot’s body is real- 
ized — only a small subset of activations can 
follow any particular prior activation. In other 
words, the robot’s embodiment dramatically 
reduces the number of potential strategies, so 
that sensible actions occupy a severely reduced 
behaviour space (think of a line instead of 
a sphere). This reduced space is actually 
searchable in real time. 

The authors created the set of all possible 
behaviours by having each robot perform 
many thousands of motions (sequences of 
motor activations) and recording the ‘fitness 
values’ of each sequence. The fitness could 
be as simple as the distance travelled by the 
robot. Collating this database is time-inten- 
sive, but it is analogous to what happens in 
the natural world, in which living organisms 
have a lifetime to acquire such information. 
The robots synthesized new behaviours from 
this data set using a set of special-purpose 
machine-learning algorithms that assume 
that — even in changed circumstances — 
the actions that are most likely to succeed 
are ‘close’ to other such actions in a suitably 
defined behaviour space. 

Although these machine-learning algo- 
rithms are unlikely to be similar to those 
used, for example, by mammalian cognitive 
systems, they share a common premise: that a 
behaviour space that is dramatically reduced 
through embodiment, and that is learned from 
experience, can be searched quickly through 


ANTOINE CULLY/UPMC 


trial and error. If we return to the analogy of a 
one-dimensional line as opposed to a sphere of 
possible strategies, only two directions have to 
be attempted for the line before the preferred 
direction is clear, whereas in a sphere six direc- 
tions must be sampled. Given that the robot's 
behaviour space is 36-dimensional, it is clear 
that the ‘flattening’ of the space of options can 
have dramatic effects. 

Could these intuitive trial-and-error 
strategies be used to discover more-general 
problem-solving methodologies, of the kind 
that require planning in uncertain envi- 
ronments? It is difficult to imagine that the 
method could easily be scaled up to sucha 
level; this particular algorithm was hand- 
designed by the authors, whereas the ‘algo- 
rithm’ our brains use is the result of millions 


CELL BIOLOGY 


of years of Darwinian tinkering and pruning. 

Given the failure of past efforts to design 
robots that display the quick, intuitive and 
situation-appropriate behaviour of even the 
smallest rodents, perhaps it is time to give 
up on the idea that we can design brains, and 
instead place our hopes in the power of adap- 
tive and evolutionary algorithms. Indeed, 
the core algorithm that generates the map 
of possible high-performance behaviours 
in Cully and colleagues’ study is inherently 
evolutionary, because good strategies are 
improved on by replication with variation, 
and selection. 

We may never understand our brains in 
terms of information-processing concepts, but 
we do understand how to harness the power of 
evolution. We should therefore let evolution 


Polarized transport in 
the Golgi apparatus 


Proteins can be transported in either direction across a cellular organelle called 
the Golgi apparatus. It emerges that CDC42, a molecule that confers cell polarity, 
acts to control the directionality of transport in the Golgi. SEE LETTER P.529 


AKIHIKO NAKANO 


P olarity is a part of life on all scales, from 


head-to-tail polarity in whole organisms 

to polar orientation in cells. Even intra- 
cellular organelles, such as the Golgi appara- 
tus, can be polarized. Proteins synthesized by 
another organelle, the endoplasmic reticulum, 
mainly enter the Golgi on one side — dubbed 
the cis side — and, after appropriate modifi- 
cations, exit from the opposite, trans, side. 
On page 529 of this issue, Park et al.’ identify 
CDC42 as a protein that regulates the direc- 
tionality of polarized protein transport within 
the Golgi apparatus. 

The Golgi apparatus, which prepares pro- 
teins for shuttling to their final destination in 
the cell, is a mysterious organelle. More than 
100 years have passed since its discovery, but 
there are still many controversies regarding the 
mechanism that underpins its function. One 
debate famous among cell biologists concerns 
how cargoes such as secretory proteins are 
transported across the polarized structure of 
the Golgi from cis to trans” — the anterograde 
direction. Various models have been put for- 
ward to explain this directionality, but several 
lines of evidence** support ‘cisternal matu- 
ration. In this model, flattened, membrane- 
bound sacs called cisternae, which make up the 
Golgi, form on the cis side of the organelle and 
move towards the trans side as they mature, 


carrying proteins produced by the endoplas- 
mic reticulum with them as they go. 
However, this mechanism alone cannot 
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create for us what we do not understand, one 
more time. m 
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explain complicated protein-sorting events 
in the Golgi; these mediate transport not only 
from cis to trans, but also in the opposite direc- 
tion. Such retrograde transport involves vesi- 
cles coated in coat protein complex I (COPI)** 
that shuttle between cisternae. But it is unclear 
whether these vesicles are involved in antero- 
grade cargo transport. Tubular structures that 
connect cisternae have also been posited to be 
involved in intra-Golgi transport’ (Fig. 1). 
Previous work by the group that under- 
took the current study revealed that the COPI 
coat regulates not only vesicle formation but 
also the formation of tubules in the Golgi 
— although how it does this is not known. 
Park et al. demonstrated that COPI binds to 
proteins being transported in both the 


COPI Vesicle 


Protein 


Figure 1 | Bidirectional transport. The Golgi apparatus, which contains a stack of membrane-bound 
sacs called cisternae, prepares proteins for shuttling around the cell. Proteins mainly move through the 
Golgi in the anterograde direction, from the cis to the trans side, travelling from their site of synthesis in 
the endoplasmic reticulum towards the cell membrane. However, proteins can also move in the opposite, 
retrograde, direction. Proteins can be transported across the Golgi in three ways: through ‘cisternal 
maturation, in which protein-containing cisternae move from cis to trans as they mature; in vesicles 
coated in coat protein complex I (COPI), which mediate retrograde transport; and in tubules, in which 
COPI binds to proteins to promote transport in either direction. Park et al.' report that the CDC42 
protein competes with COPI for binding of retrograde, but not anterograde, cargoes in tubules, and so 
enhances anterograde and inhibits retrograde transport. 
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anterograde and retrograde directions, rec- 
ognizing different structural signals on each 
type of cargo. Binding seems to be required for 
efficient transport of both types of cargo. 

Small GTPase proteins act as molecular 
switches for a wide variety of cellular functions. 
One small GTPase, CDC42, was originally 
documented to control cell division in budding 
yeast, and has since been found to regulate cell 
polarity’. The authors investigated the pos- 
sibility that CDC42 regulates the selectivity of 
COPI binding, because it is already known to 
interact with COPI to modulate anterograde 
cargo transport in the Golgi®. They found that 
an activated mutant form of CDC42 acceler- 
ated anterograde transport in the Golgi, but 
repressed retrograde transport. By contrast, 
reducing levels of CDC42 activity inhi- 
bited anterograde, but enhanced retrograde, 
transport. 

Next, the authors analysed the role of 
CDC42 in transport through the Golgi tubules 
specifically. Using a ‘reconstitution systen’ 
designed to generate COPI-coated tubules’, 
Park et al. showed that the activated form of 
CDC42 reduced the level of retrograde trans- 
port by preventing COPI from binding to 
retrograde cargoes in tubules. However, the 
mutant protein had no effect on anterograde 
cargoes. The researchers demonstrated that 
CDC42 imparts sorting specificity to Golgi 
tubules in two ways: first, through two lysine 
amino-acid residues in its carboxy-terminal 
end that compete with retrograde, but not 
anterograde, cargoes for binding to COPI; 
and second, by promoting the formation of 
tubules at the expense of COPI-coated vesicles, 
thus reducing retrograde transport. Finally, an 
analysis of small GTPases, including RhoA, 
Racl and some Rab and Arf GTPases, revealed 
no others that could affect the directionality 
of intra-Golgi transport, demonstrating the 
specificity of CDC42. 

If, as these results indicate, CDC42 does 
indeed modulate the sorting of anterograde 
and retrograde cargoes in the Golgi tubules, 
then this study takes an important step towards 
providing a fuller understanding of the exact 
mechanism that underpins intra-Golgi 
transport. CDC42 regulates cellular polarity 
through its intimate interaction with actin fila- 
ments, which help to maintain the structure 
of the cell’. The role of CDC42 in the Golgi 
tubules probably does not directly involve 
actin, and so how the protein regulates direc- 
tionality in this setting is a question for the 
future. Other, as-yet-undefined factors must 
be acting with CDC42. 

Whether the tubules containing anterograde 
cargoes do indeed connect Golgi cisternae 
in anterograde transport also remains to be 
definitively proved. The authors suggest that 
SRC tyrosine kinase, an enzyme frequently 
mutated in cancer, modulates CDC42, which 
leads them to hypothesize that there might be 
a relationship between intra-Golgi transport 
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and tumour development. But, ultimately, the 
cue that switches on anterograde transport 
remains elusive. 

CDC4?2 is evolutionarily conserved, being 
found in organisms from yeast to mammals. 
Plants also have many Rho-family GTPases. 
Whether these related proteins play simi- 
lar parts to CDC42 in cargo binding, tubule 
formation and directionality in the Golgi 
would be of great interest to those working 
on the organelle. Trafficking pathways are 
thought to have undergone parallel evolution 
— have these proteins remained fundamental 
players? m 
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The origins of a climate 


oscillation 


An index of water -circulation strength in the North Atlantic Ocean has been 
derived from sea-level measurements. This provides fresh evidence of the 
ocean’s leading role in multidecadal climate variability. SEE LETTER P.508 


SERGEY K. GULEV & MOJIB LATIF 


r | Whe Earth has warmed considerably 
during the twentieth and twenty-first 
centuries, most probably because of 

the effects of greenhouse gases emitted as a 

result of human activities. But strong natural 

variability on a wide range of timescales, from 
monthly to multidecadal, is superimposed on 
the long-term global-warming trend and also 
causes considerable spatial variation of that 
trend. We therefore need to understand this 
variability to discriminate anthropogenic 
effects from natural climate forcing. On page 

508 of this issue, McCarthy et al.’ report a 

method that enables the effects of ocean 

circulation on one of the most prominent 
examples of long-term climate variability — 
the Atlantic Multidecadal Oscillation (AMO) 

— to be identified from long-term sea-level 

data. 

Natural climate variability can be generated 
internally by interactions within or between 
climate-system components such as the atmos- 
phere, ocean and sea ice, and externally by fac- 
tors such as volcanic eruptions. The AMO is 
an example of long-term climate variability 
associated with the ocean”. It represents quasi- 
periodic oscillations of sea-surface tempera- 
ture (SST) in the North Atlantic Ocean that 
have a period of about 70 years. The AMO 
has been described from instrumental SST 
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records” going back to the mid-nineteenth 
century and reconstructed from proxy data* 
for the past few centuries. 

Climate models” and analysis of surface 
heat fluxes* have suggested that the AMO is 
an internal mode of climate variability origi- 
nating from changes in the circulation of the 
Atlantic Ocean, but its origin is still debated. 
Signals seen in SST may derive from changes 
in the ocean’s interior that drive circulation 
changes’, or from the influence of factors out- 
side the ocean’. Lack of data from the ocean's 
subsurface hinders attempts to quantify the 
relative contributions of internal and external 
processes to the AMO. 

McCarthy and colleagues’ method for iden- 
tifying how ocean circulation affects the AMO 
is based on the hypothesis that ocean currents 
on relatively large scales of space and time (sev- 
eral tens of kilometres and several weeks) are 
broadly geostrophic — that is, the direction 
and strength of the currents depends on the 
balance between the Coriolis force associated 
with Earth’s rotation and the horizontal pres- 
sure-gradient force in the ocean (Fig. 1). Their 
approach builds on the proposal’ that the dif- 
ference in dynamic height (the sea level caused 
by variations in the ocean's depth-integrated 
density) between the centres of large, primar- 
ily wind-driven, ocean-circulation systems 
called gyres in the subtropical and subpolar 
North Atlantic provides a proxy for long-term 
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Figure 1 | Circulation and pressures in the North Atlantic Ocean. The map depicts the mean dynamic 
topography — the oceanic relief that corresponds to long-term, averaged ocean circulation — in the 
North Atlantic for the years 1993 to 2013. Positive and negative values represent heights above and below 
the level of the unperturbed ocean, respectively. The topography is a measure of ocean pressure: low 
pressure occurs in the subpolar-gyre region (blue) and high pressure in the subtropical-gyre region (red). 
Short black lines indicate local current velocity vectors and white arrows indicate the main ocean currents, 
which follow maximum pressure gradients. McCarthy et al.' report a method for identifying how ocean 
circulation affects the Atlantic Multidecadal Oscillation — quasi-periodic oscillations of sea surface 
temperature in the North Atlantic that have a period of about 70 years. Their approach assumes that the 
difference in heights between the centres of the gyres provides a proxy for long-term changes in mass 
transport in the North Atlantic. Figure adapted from CNES/CLS 2012 (produced by AVISO). 


changes in mass transport by the Gulf Stream 
and its extension, the North Atlantic Current. 
The difference in heights is a measure of the 
pressure gradient between the gyres’ centres 
and has previously been estimated’ from off- 
shore sea-level records in the subtropics at 
Bermuda, and from long-term hydrographic 
data at subpolar latitudes. 

But the Bermuda sea-level data are noisy; 
coastal tide gauges would provide more- 
robust signals’*. By taking sea-level differ- 
ences between coastal tide gauges to the south 
and north of Cape Hatteras — the boundary 
between the subtropical and subpolar gyres 
— as an estimate of the pressure gradient 
between the gyres, McCarthy and co-workers 
derived a simple index of the North Atlantic 
circulation strength (see Fig. 1 of the paper’) 
that closely correlates with heat transport in 
ocean models. Positive values of this index 
reflect an enhanced pressure gradient com- 
pared with the long-term mean and imply 
strong northward heat transport and increas- 
ing upper-ocean heat content in mid-latitude 
and subpolar regions of the North Atlantic. 
Negative values imply lowering of the pres- 
sure gradient, weakening of northward heat 
transport and diminishing heat content 
in the upper ocean. Although this index is 


influenced by many other factors in the Gulf 
Stream region, it seems to be an effective 
proxy for the AMO. 

McCarthy et al. also find that fluctuations 
in the sea-level index (and therefore changes 
in ocean circulation) precede anomalies in 
upper-ocean heat content in the subpolar-gyre 
region by approximately two years. This time 
lag is important, because it potentially ena- 
bles prediction of heat-content changes in the 
subpolar-gyre region from sea-level data, and 
thus prediction of AMO evolution and related 
climate anomalies. 

The authors also used a numerical model 
of the ocean (an eddy-permitting ocean gen- 
eral circulation model, for those in the know), 
forced by observed atmospheric conditions at 
the surface, to investigate the interconnections 
between dynamic sea-level gradients, circula- 
tion strength and upper-ocean heat content, 
including time lags. Despite the limitations 
of such models, especially in the Gulf Stream 
region and at high latitudes, the simulations 
largely support the results from the data 
(Fig. 2 of ref. 1), which is reassuring. 

McCarthy and colleagues’ study supports 
previous conjectures’ that the ocean integrates 
chaotic atmospheric variability primarily asso- 
ciated with a climatic phenomenon known as 


© 2015 Macmillan Publishers Limited. All rights reserved 


NEWS & VIEWS | RESEARCH | 


the North Atlantic Oscillation’, and responds 
by generating SST variability on multidecadal 
time scales through dynamical ocean processes 
and with a time lag of about a decade — spe- 
cifically by causing changes in the large-scale 
ocean circulation called the Atlantic Meridi- 
onal Overturning Circulation (AMOC). The 
findings therefore further reinforce the idea 
that the AMO is an internal mode of climate 
variability. 

The work also has crucial implications for 
our understanding of the ocean’s role in cli- 
mate variability and change. First, if the AMO 
originates from ocean dynamics, then attempts 
to detect early signs of the effects of anthropo- 
genic climate change in the North Atlantic will 
be hampered. For example, the climate effects 
of AMOC slowdown projected by many cli- 
mate models in response to global warming 
could be masked for decades by a positive 
phase of the AMO. Second, the AMO has been 
linked to climate anomalies in many regions of 
the globe’”””. The study therefore suggests that 
decadal predictions of climate over the North 
Atlantic and adjacent continents are possible 
if enough ocean data are available for forecast 
initialization. This would be of enormous 
societal benefit to many countries. 

Further efforts are needed to conclusively 
prove that the ocean steers the AMO. Rel- 
evant instrumental records (including sea 
levels from tide gauges) are available only 
for the past several decades, with the long- 
est dating back to the 1920s and 1930s. Buta 
thorough assessment of the ocean's role in the 
multidecadal climate variability of the North 
Atlantic sector will require proxy data dating 
back for many centuries or even millennia. 
Multi-year experiments to measure ocean 
currents and cross-sections — using moored 
measuring devices, satellite data, and buoys 
that sample the upper ocean — have enabled 
studies over the past two decades and have 
captured long-term data. These programmes 
must continue, given the long timescales 
involved in the AMO. Climate models initial- 
ized with such data will provide more-reliable 
predictions of future climate than are cur- 
rently available. 

The synthesis of models and data will poten- 
tially allow the prediction of multidecadal 
variations. However, climate models still suf- 
fer from large biases, especially in the North 
Atlantic. Improvements to climate models 
are therefore also urgently needed. But it 
remains to be seen whether atmosphere- 
ocean interactions such as those described by 
McCarthy et al. will remain the same under 
anthropogenic climate change. m 
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CANCER METABOLISM 


A waste of insulin 
interference 


Many people with cancer die from a wasting disorder called cancer-associated 
cachexia. Two studies in fruit flies show that inhibition of insulin signalling 
causes cachexia-like organ wasting. 


ERWIN F. WAGNER & MICHELE PETRUZZELLI 


advanced-stage cancer are killed not by the 

tumour itself, but by a metabolic disorder 
called cancer-associated cachexia (CAC)’”, 
which is characterized by systemic inflamma- 
tion, weight loss, body-fat atrophy and skele- 
tal-muscle wasting. Cachexia is also a feature 
of several chronic conditions””, including 
heart failure, lung disease and infectious dis- 
eases such as HIV. At present there is no cure 
for CAC and no biomarkers are available for 


I tis estimated that up to 30% of people with 
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identifying patients at risk of developing the 
disorder. Thus, there is a need to better under- 
stand the origins of cachexia, its systemic pro- 
gression and the molecular pathways involved 
in its development. Two papers*” in Develop- 
mental Cell now report that interference with 
insulin signalling in the fruit fly Drosophila 
melanogaster induces systemic organ wasting 
reminiscent of human cachexia. 

The cancer-causing protein Yap] is part of 
the Hippo signalling pathway, which induces 
cell proliferation, in part by increasing sig- 
nalling by insulin and by a related protein, 
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Figure 1 | Systemic metabolic changes in cancer. Cancer-associated cachexia is defined by harmful 
abnormalities that are brought about by factors secreted by the growing tumour, and by adaptive 
responses in host tissues. Solid double-headed arrows indicate that the molecular mechanisms 
underpinning the trait (which are thought to be bidirectional) have been defined, whereas dashed double- 
headed arrows indicate that causative factors remain unknown. Kwon et al.’ and Figueroa-Clarevega 

and Bilder* demonstrated that the protein ImpL2, which is secreted from hyperproliferating tissues and 
tumours, impairs insulin signalling in fruit flies. This interference is the cause of atrophy in skeletal 
muscle, fat and gonadal tissues (indicated by the blue arrows, which are unidirectional to reflect the fact 
that bidirectional interactions have little time to become established in the fly studies.) 
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insulin-like growth factor 1 (IGF1) (ref. 6). 
Kwon et al.’ investigated the systemic changes 
caused by intestinal activation of Yorkie, an 
equivalent of Yap1, in fruit flies. The authors 
report that activation of Yorkie in the gut 
causes over-proliferation of intestinal cells, 
and leads to the secretion of ImpL2, an insulin 
growth factor binding protein (IGFBP) that 
inhibits insulin and IGF] signalling. Secre- 
tion of ImpL2 in the intestine caused systemic 
wasting in muscles and distant organs, includ- 
ing the ovaries and the fat body (an organ 
analogous to vertebrate fat and the liver). This 
remarkable finding implies that over-prolifer- 
ation in a given tissue can lead to distant meta- 
bolic changes and wasting symptoms, similar 
to human cachexia. 

Using a different fly model, Figueroa- 
Clarevega and Bilder’ find evidence to 
corroborate Kwon and colleagues’ study. 
They transplanted tumours into adult flies, 
and found that malignant, but not benign, 
tumours caused wasting of fat, muscle and 
gonadal tissue. Furthermore, the tumours 
induced down-regulation of the insulin sig- 
nalling pathway in peripheral organs, leading 
to insulin resistance in these regions. Inves- 
tigating the differences between benign and 
malignant tumours, the authors found that 
ImpL2 was one of the genes most upregulated 
in malignant tumours. They demonstrated 
that overexpression of ImpL2 in specific tis- 
sues in flies without tumours led to wasting 
in distant locations. However, inhibition of 
ImpL2 in flies with tumour transplants only 
partially ameliorated wasting, suggesting that 
other molecular interactions between the 
tumour and the host remain to be discovered. 

Notably, the systemic wasting traits 
described in the two papers correlate with 
other features that are associated with high 
blood sugar. Furthermore, this wasting is 
independent of changes in food intake or local 
damage to organs at the site of tumour growth. 
This, taken together with the other results, 
indicates that factors secreted from tumours or 
hyper-proliferating tissues in D. melanogaster 
perturb systemic metabolism and induce wast- 
ing in distant organs. 

Insulin resistance is common in patients 
with cancer, and may contribute to skeletal- 
muscle wasting in mouse models of CAC’. 
Previous studies’ of how tumour-derived fac- 
tors affect the organism have primarily focused 
on a group of cell-signalling molecules called 
pro-inflammatory cytokines, which promote 
systemic inflammation. Alterations in systemic 


metabolism that arise as a consequence of 
tumour growth have been regarded as sec- 
ondary effects of inflammation. The current 
studies provide evidence that impaired insulin 
signalling is itself a direct cause of CAC devel- 
opment. However, whether insulin resist- 
ance is sufficient to cause cachexia in model 
systems other than fruit flies remains to be 
demonstrated. 

Insulin and IGF1 signalling are key regula- 
tors of tissue mass in both flies and mammals, 
and it is possible that IGFBPs are differentially 
regulated in cancers that are associated with 
cachexia compared with those that are not. 
Little is known about the regulation of ImpL2 
in flies and mice, but it is probable that stress 
factors, which activate inflammatory pathways 
and Hippo signalling’, could induce ImpL2 
expression. It is interesting that other tumour- 
specific proteins, such as the Drosophila 
cytokine Upd2, are unable to induce organ 
wasting in flies, whereas the equivalent pro- 
tein in mice, IL-6, isa mediator of CAC’. Sur- 
prisingly, a role for the immune system is not 
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discussed in the two papers despite inflamma- 
tion being an accepted hallmark of cachexia. 
These two studies highlight the importance 
of studying the metabolic response to cancer. 
Although our knowledge of the metabolism 
of cancer cells themselves is steadily improv- 
ing, the characterization of organism-wide 
metabolic changes in response to cancer is 
still incomplete (Fig. 1). The focus of can- 
cer research is gradually expanding, from 
the cancer cell to the tumour microenviron- 
ment, to the system as a whole. Supporting 
the need to study organism-wide metabolism, 
abnormal alterations in organs at a distance 
from the primary tumour that are independ- 
ent of the process of cancer metastasis have 
been described*"’. Remarkably, targeting 
such alterations has therapeutic value in 
mice, ameliorating total body-weight loss 
and skeletal-muscle and adipose-tissue atro- 
phy, without directly affecting the mass of 
the tumour*”””. The tiny fruit fly nicely illus- 
trates the value of broadening our horizons 
to encompass the organism as a whole, and 


Taking the bad 
with the good 


Modelling of the interactions between antibiotic production and antibiotic 
degradation reveals that these opposing activities are key to maintaining 
diversity in microbial communities. SEE LETTER P.516 


CARL T. BERGSTROM & BENJAMIN KERR 


e commonly expect competitive eco- 

logical interactions to be transitive: 

if ravens displace crows and crows 
displace jays, then ravens should displace jays 
as well. But the world does not always work 
this way. Increasingly, researchers are find- 
ing intransitive relationships, in which ravens 
displace crows, which displace jays, which in 
turn displace ravens. Intransitive relationships 
occur in animals!”, plants® and microbes”®. 
Theoretical models show that species abun- 
dances can cycle in intransitive communities, 
in principle preserving species diversity”. 
However, in finite populations, extinction can 
readily occur when one type cycles to low abun- 
dance. On page 516 of this issue, Kelsic et al? 
modelan intransitive system in which microbial 
species produce antimicrobial compounds and 
exhibit differing sensitivities to the products of 
their competitors. By demonstrating that anti- 
microbial degradation can stabilize a multi- 
species community, the authors suggest a new 
solution to the puzzle of how bacterial diversity 
is maintained”. 


To illustrate Kelsic and colleagues’ model, 
consider the rock—paper-scissors (RPS) sce- 
nario familiar to many as a game. Imagine 
three microbial species called Rock, Paper 
and Scissors, each of which produces a unique 
antimicrobial compound and is immune to 
its own toxin. If Rock kills Scissors, Scissors 
kills Paper and Paper kills Rock, we have the 
standard situation (Fig. la). Each bacterial 
species must protect itself from the toxin of 
its victim; for example, Scissors protects itself 
from Paper’s toxin. Kelsic et al. focused on a 
neglected aspect of this protection: it may be 
non-excludable, meaning that protection may 
spill over to other species. 

Such ‘leakiness’ may occur ifa cell degrades 
the antimicrobials of a competing species by 
secreting enzymes that do the job externally, 
or by deactivating the competitor’s antimicro- 
bials once they have entered the cell’. Either 
way, the concentration of the antimicrobial in 
the environment is reduced. The RPS scenario 
can be adapted to account for this leakiness. 
For instance, when Scissors protects itself from 
Paper's toxin, partial protection would extend 
to Rockas well — here, Scissors inadvertently 
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of using animal models of cancer to explore 
this macroenvironment. = 
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helps its own enemy (Fig. 1b, orange line). 
Kelsic and colleagues develop a mathemati- 
cal model showing that the ‘public good’ of 
leaky protection and the ‘public bad’ of toxin 
production can interact to permit stable 
coexistence between multiple species. 

Models with diffusible public goods and 
public bads are complicated. To improve our 
intuition about Kelsic and colleagues’ model, 
let us reframe this population-level interaction 
as a two-player game. Instead of having many 
toxic microbes interacting with one another 
simultaneously, we consider pairwise inter- 
actions between individuals, each of whom 
play one of the RPS strategies. Players meet at 
random and receive a pay-off of 1 for winning 
or drawing and 0 for losing. They then replicate 
according to their pay-offs, and faithfully pass 
on their strategies to their offspring. A popula- 
tion of individuals playing this game undergoes 
unstable cycles; this scenario corresponds to 
the mathematical model developed by Kelsic 
et al. when there is no leaky protection. If 
instead the pay-off for winning is 2, draw- 
ing is 1 and losing is 0, we have the RPS game 
more commonly studied in the literature; this 
version has neutrally stable cycles*. Whether 
the cycles are neutrally stable or unstable, two 
of the three strategies will eventually be lost in 
a finite population (Fig. la). 

Now suppose that we add one or more 
‘bystanders’ to the game. Within the pair, a 
would-be winner is ineffectual if its enemy 
is standing by, and the game ends in a draw. 
With one or two bystanders, randomly chosen 
from the population, the chances of having 
a bystander that can interfere are too low to 
alter the dynamics qualitatively. But with more 
bystanders, the dynamics change completely: 
the community can approach a stable balance 
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Figure 1 | Cyclical dynamics in a ‘rock-paper-scissors’ game with public goods. a, Ina standard 
rock-paper-scissors game, rock beats scissors, which beats paper, which beats rock. Here, Rock, Paper 
and Scissors represent bacterial species, each of which produces an antimicrobial toxin (black lines). Each 
species is immune to the toxin produced by its victim. Kelsic et al. show that, when such immunity is not 
‘leaky; the proportions of the three species fluctuate in unstable cycles until two of the species are lost. 

b, When immunity is conferred by antibiotic degradation, this protection may spill over to other species 
(coloured lines). When such ‘public goods’ are produced in appropriate quantities, the cycles dampen in 


amplitude and all three types coexist stably. 


of the three strategies, in which all three types 
can coexist indefinitely (Fig. 1b). The more 
bystanders that are present, the easier it is for 
coexistence to be maintained. 

Why do bystanders facilitate coexistence? In 
the standard RPS game, each type directly hurts 
its victim and thereby indirectly helps its enemy 
(its victim's victim). This process occurs in the 
bystander version of the game as well, but in 
addition, each type directly helps its enemy. As 
a player, Paper defeats a Rock partner; and as 
a bystander, Paper prevents Rock from defeat- 
ing Scissors. Thus, Paper helps Scissors in two 
ways: by reducing the proportion of Rock 
players overall (the indirect route) and by 
ameliorating the harm that Rock does to 
Scissors when they meet (the direct route). If 
one type helps its enemy enough by the direct 
route, the additional source of feedback halts 
the proliferation of any type that becomes com- 
mon, and can stabilize the entire community. 

By adding bystanders to the basic RPS game, 
the model captures the essential biological 
feature of bacterial interactions that drives 
Kelsic and colleagues’ findings: that bacteria 
of a first species protect bacteria of a second 
one when they protect themselves by disabling 
the antimicrobial compound of a third. Thus, 
addition of bystanders captures the leakiness 
of antimicrobial protection. (For a full descrip- 
tion of the bystander model and its dynamics, 
see go.nature.com/yjzkfm.) 

Kelsic et al. make several important con- 
tributions to our understanding of microbial 
community dynamics. They demonstrate 
experimentally that one microbe can pro- 
tect a second from a third. They demonstrate 
mathematically how bystander protection can 
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stabilize multi-species bacterial communities. 
They show that their qualitative findings are 
robust to variation in the form of the model, its 
parameters, the community structure and the 
number of species. Finally, they illustrate that 
the evolution of new strategies need not disrupt 
stable communities of this sort. In particular, 
coexistence can be robust against ‘cheaters’: 
mutants that obtain a fitness benefit by ceasing 
to produce public goods or public bads. 

The authors also acknowledge several 
caveats to their work. If bystander protec- 
tion is excessively weak or excessively strong, 
species cannot coexist indefinitely. Even if 
the protection is just right, coexistence will 
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not occur if the starting proportions of each 
species are too uneven. 

Kelsic et al. have derived a key theoretical 
result: that, in principle, multiple microbial 
species can coexist indefinitely even in a well- 
mixed environment without spatial refugia. 
The obvious next step is to return to experi- 
mental studies to test whether this mechanism 
can support bacterial diversity in practice. Is 
protection strong enough to stabilize coexist- 
ence, and can a well-mixed three-species sys- 
tem coexist stably under laboratory conditions? 
Coexistence can withstand the emergence of 
cheaters in the model; can it do so in microbial 
culture as well? We are eager to learn whether, 
in natural microbial communities with intran- 
sitive competition, diversity is promoted when 
public goods take out public bads. m 
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The middle Pliocene 
gets crowded 


New hominin fossils discovered in Ethiopia, dated to between 3.5 million and 
3.3 million years ago, suggest that species diversity may have been as high during 
early human evolution as in later periods. SEE ARTICLE P.483 


FRED SPOOR 


bs many years, human evolution was 


viewed as a diverse radiation of species 
emerging a little after 3 million years 
(Myr) ago following an earlier phase charac- 
terized by little or no diversity (Fig. 1). Best 
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known from this earlier period is the middle 
Pliocene species Australopithecus afaren- 
sis, which is documented by fossils between 
3.7 and 3.0 Myr old from eastern Africa, 
including the emblematic skeleton known as 
Lucy’. On page 483 of this issue, Haile-Selassie 
et al.’ describe a new species, Australopithecus 


deyiremeda, which adds to other evidence** 
indicating that A. afarensis was not the only 
ancient human relative during this time 
period. Indeed, it seems that the hominins that 
populated Africa in the middle Pliocene may 
have been just as diverse taxonomically as later 
stages of human evolution. 

In the 1970s and 1980s, it was hotly debated 
whether the middle Pliocene hominin fos- 
sils found in Ethiopia and Tanzania represent 
more than one species, but a broad consensus 
emerged that they could all be attributed to the 
single variable species A. afarensis’. Further 
fossil finds rekindled the debate about taxo- 
nomic diversity and led to the naming of two 
new species: Australopithecus bahrelghazali in 
1996, based on a 3.6-Myr-old partial lower jaw 
from Chad’, and Kenyanthropus platyops in 
2001, based on cranial fossils 3.5-3.3 Myr old 
from Lomekwi, Kenya*. The notion of multi- 
ple, contemporary middle Pliocene species was 
highlighted more recently by the discovery at 
Burtele, Ethiopia, of 3.4-Myr-old foot bones that 
are too primitive to belong to A. afarensis’, and 
by the revised dating of a South African Aus- 
tralopithecus skeleton to 3.7 Myr ago’ (Fig. 1). 

It is against this background that Haile- 
Selassie and colleagues describe A. deyiremeda, 
which is represented by fossils between 3.5 Myr 
and 3.3 Myr old discovered at Woranso-Mille in 
Ethiopia. The location is close to sites of similar 
age that have produced abundant A. afarensis 
specimens’, suggesting that the two species lived 
in close proximity in place as well as in time. 

The A. deyiremeda fossils include a partial 
maxilla (upper jaw bone) and two mandibles 
(lower jaw bones). The maxilla shows that 
the cheek bones were situated more forward 
than in A. afarensis, and the mandibles have 
a thick body because the ramus (the part 
connecting with the cranium) ascends from 
a more-forward position. The cheek teeth 
are notably small, except for the lower third 
molars. Such comparisons can be made with 
confidence because A. afarensis is known from 
one of the largest samples in the hominin fos- 
sil record, providing a good representation of 
morphological variation’. 

But how does A. deyiremeda compare with 
the other middle Pliocene hominin species? 
As noted by Haile-Selassie and colleagues, it 
shares several diagnostic features with K. platy- 
ops, including forward cheek bones, small first 
and second molars, and three-rooted upper 
premolars’. However, the two species differ in 
the front part of the maxilla, which is flat and 
non-projecting in K. platyops but curved and 
protruding in A. deyiremeda and A. afarensis. 
Similarly, the A. deyiremeda mandibles share 
a forward position of the ramus with two pos- 
sible K. platyops specimens from Lomekwi, but 
they are more like A. afarensis in the midplane. 
In broad terms, A. deyiremeda can be seen 
as sharing newly evolved morphology with 
K. platyops in the cheek areas of the jaws, while 
showing a more primitive, A. afarensis-like 
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Figure 1 | Temporal distribution of known hominin species. Bars show when evidence of various 
hominin species occurs in the fossil record between 4 million and 2 million years (Myr) ago. The long- 
standing interpretation of hominin diversity was that Australopithecus afarensis and its probable ancestor 
A. anamensis formed a single lineage during the middle Pliocene (blue bars), and that this was followed 
by a radiation of Australopithecus, Homo and Paranthropus species after 3 Myr ago (green bars). However, 
more-recent fossil evidence indicates taxonomic diversity during the middle Pliocene (red bars), 
including Kenyanthropus platyops, multiple species of Australopithecus, and an unassigned foot fossil 
from Burtele, Ethiopia. Haile-Selassie and colleagues’ announcement’ of the species Australopithecus 
deyiremeda adds to this early diversity. The earliest archaeological evidence for the use of stone tools, 


from the Lomekwi and Dikika sites, is also indicated. 


shape towards the front. Whether the retention 
of this primitive morphology in A. deyiremeda 
does indeed warrant species-level distinction 
from K. platyops must be confirmed by fur- 
ther study and fossil finds. Comparison with 
A. bahrelghazali is difficult because this species 
is known only from the front part of a man- 
dible and an isolated premolar. It seems that 
one or more additional specimens have been 
recovered’ but these remain undescribed. 

Finding such taxonomic diversity raises the 
question of how multiple species could have 
coexisted over a long period in a stable eco- 
system, particularly when they live in close 
geographic proximity, as seems to be the case 
with A. deyiremeda and A. afarensis. Niche 
partitioning, involving diverse dietary prefer- 
ences, foraging strategies, habitat selection and 
population movements, will probably be the 
key factor. However, establishing a concrete 
link between such characteristics and the mor- 
phological differences that distinguish species 
is often difficult, not least because the mor- 
phology may be affected by random genetic 
drift as much as by selection. A. deyiremeda, 
A. afarensis and K. platyops differ in their jaws 
in ways that reflect the position of the chew- 
ing muscles and the size of the tooth crowns. 
These differences provide an opportunity to 
investigate whether feeding behaviour and diet 
played a part, by modelling the biomechanics 
of chewing and assessing the dental wear and 
stable isotopes present in the fossils, both of 
which can give an indication of the types of 
food eaten by the individual. 

Any discussion of niche partitioning and 
foraging behaviour among middle Pliocene 
hominins should consider the discovery of 
3.4-Myr-old cut-marked bones at Dikika’ in 
Ethiopia, and of 3.3-Myr-old stone tools at 
Lomekwi". Differences in the use of stone 
tools may reflect species-specific behavioural 
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niches, so it would be informative if we could 
identify the Lomekwi toolmaker. The tools 
were found in close geographical and tempo- 
ral proximity to where a key K. platyops speci- 
men (the paratype) was discovered, making 
this species the default candidate. 

However, a further complication now 
emerges: the K. platyops paratype is a maxil- 
lary fragment that was associated with the 
holotype (main defining specimen) on the 
basis of three shared features: forward cheek 
bones, three-rooted premolars, and a small 
first-molar crown’. But these three features 
are now also found in A. deyiremeda, and 
the distinctive front part of the maxilla that 
sets K. platyops apart is not preserved in the 
paratype. Hence, it cannot be excluded that it 
was A. deyiremeda that made an appearance 
at the tool site. Regardless, associating stone 
tools with a specific species as the toolmaker is 
notoriously difficult, and the increasingly rich 
fossil record of the middle Pliocene provides 
plenty of opportunity for lively debate. m 
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hen the phrase ‘artificial intelligence’ was 
Wis: coined in 1956 by computer scientists, 

expectations for the development of 
machines with human-like intelligent reasoning and 
behaviour were high, but the field was in for a long wait. 
In the decades to come, the term referred largely to 
popular culture. Worse, the relative failure of algorithms 
that attempted to mimic human reasoning at the higher, 
symbolic level gave the field a bad name and led toa 
long-term freeze in funding. 

In the meantime, conventional, not-so- intelligent 
computers became faster, more powerful and consumer- 
friendly thanks to purposeful investments by an 
ambitious electronics industry. 

However, there is not much room left for computer 
chips to improve further, owing to physical limitations. 
In the past decade, the world has also seen a massive 
explosion of data, and retrieving meaningful 
information from this deluge will soon be impossible 
with conventional computers. Fortunately, work on 
artificial neural networks has steadily continued in 
the background, and the field has recently made big 
conceptual breakthroughs. Together with the availability 
of powerful computer processors and large amounts of 
data for the algorithms to train on, the field of artificial 
intelligence has made a comeback, demonstrating 
machine-learning applications such as those that 
process visual and linguistic information in a human- 
like manner. 

Another route to machine intelligence is robotics, 
in which the presence of an artificial being in the 
physical world as well as sensory input is essential to its 
intelligent behaviour. This area, profiting from advances 
in chip technology, computer algorithms and smart 
materials, is making big strides towards the creation of 
robots that can safely assist humans in a range of tasks. 

In this Insight, we have collected some of the most 
exciting developments in machine learning and 
robotics. Expectations are again high, but, as the 
following Reviews demonstrate, there are several 
exciting avenues now open to further research. With the 
right safeguards in place, these opportunities could be 
essential to addressing the challenges of a complex world 
in the twenty-first century. 


Tanguy Chouard & Liesbeth Venema 
Senior Editors 
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Deep learning 


Yann LeCun’”, Yoshua Bengio® & Geoffrey Hinton*? 


Deep learning allows computational models that are composed of multiple processing layers to learn representations of 
data with multiple levels of abstraction. These methods have dramatically improved the state-of-the-art in speech rec- 
ognition, visual object recognition, object detection and many other domains such as drug discovery and genomics. Deep 
learning discovers intricate structure in large data sets by using the backpropagation algorithm to indicate how a machine 
should change its internal parameters that are used to compute the representation in each layer from the representation in 
the previous layer. Deep convolutional nets have brought about breakthroughs in processing images, video, speech and 
audio, whereas recurrent nets have shone light on sequential data such as text and speech. 


achine-learning technology powers many aspects of modern 
M society: from web searches to content filtering on social net- 

works to recommendations on e-commerce websites, and 
it is increasingly present in consumer products such as cameras and 
smartphones. Machine-learning systems are used to identify objects 
in images, transcribe speech into text, match news items, posts or 
products with users’ interests, and select relevant results of search. 
Increasingly, these applications make use of a class of techniques called 
deep learning. 

Conventional machine-learning techniques were limited in their 
ability to process natural data in their raw form. For decades, con- 
structing a pattern-recognition or machine-learning system required 
careful engineering and considerable domain expertise to design a fea- 
ture extractor that transformed the raw data (such as the pixel values 
of an image) into a suitable internal representation or feature vector 
from which the learning subsystem, often a classifier, could detect or 
classify patterns in the input. 

Representation learning is a set of methods that allows a machine to 
be fed with raw data and to automatically discover the representations 
needed for detection or classification. Deep-learning methods are 
representation-learning methods with multiple levels of representa- 
tion, obtained by composing simple but non-linear modules that each 
transform the representation at one level (starting with the raw input) 
into a representation at a higher, slightly more abstract level. With the 
composition of enough such transformations, very complex functions 
can be learned. For classification tasks, higher layers of representation 
amplify aspects of the input that are important for discrimination and 
suppress irrelevant variations. An image, for example, comes in the 
form of an array of pixel values, and the learned features in the first 
layer of representation typically represent the presence or absence of 
edges at particular orientations and locations in the image. The second 
layer typically detects motifs by spotting particular arrangements of 
edges, regardless of small variations in the edge positions. The third 
layer may assemble motifs into larger combinations that correspond 
to parts of familiar objects, and subsequent layers would detect objects 
as combinations of these parts. The key aspect of deep learning is that 
these layers of features are not designed by human engineers: they 
are learned from data using a general-purpose learning procedure. 

Deep learning is making major advances in solving problems that 
have resisted the best attempts of the artificial intelligence commu- 
nity for many years. It has turned out to be very good at discovering 


intricate structures in high-dimensional data and is therefore applica- 
ble to many domains of science, business and government. In addition 
to beating records in image recognition’ “and speech recognition® ”’, it 
has beaten other machine-learning techniques at predicting the activ- 
ity of potential drug molecules’, analysing particle accelerator data”, 
reconstructing brain circuits", and predicting the effects of mutations 
in non-coding DNA on gene expression and disease’*"’. Perhaps more 
surprisingly, deep learning has produced extremely promising results 
for various tasks in natural language understanding", particularly 
topic classification, sentiment analysis, question answering” and lan- 
guage translation’*”. 

We think that deep learning will have many more successes in the 
near future because it requires very little engineering by hand, so it 
can easily take advantage of increases in the amount of available com- 
putation and data. New learning algorithms and architectures that are 
currently being developed for deep neural networks will only acceler- 
ate this progress. 


Supervised learning 

The most common form of machine learning, deep or not, is super- 
vised learning. Imagine that we want to build a system that can classify 
images as containing, say, a house, a car, a person or a pet. We first 
collect a large data set of images of houses, cars, people and pets, each 
labelled with its category. During training, the machine is shown an 
image and produces an output in the form of a vector of scores, one 
for each category. We want the desired category to have the highest 
score of all categories, but this is unlikely to happen before training. 
We compute an objective function that measures the error (or dis- 
tance) between the output scores and the desired pattern of scores. The 
machine then modifies its internal adjustable parameters to reduce 
this error. These adjustable parameters, often called weights, are real 
numbers that can be seen as ‘knobs that define the input-output func- 
tion of the machine. In a typical deep-learning system, there may be 
hundreds of millions of these adjustable weights, and hundreds of 
millions of labelled examples with which to train the machine. 

To properly adjust the weight vector, the learning algorithm com- 
putes a gradient vector that, for each weight, indicates by what amount 
the error would increase or decrease if the weight were increased bya 
tiny amount. The weight vector is then adjusted in the opposite direc- 
tion to the gradient vector. 

The objective function, averaged over all the training examples, can 


‘Facebook Al Research, 770 Broadway, New York, New York 10003 USA. New York University, 715 Broadway, New York, New York 10003, USA. Department of Computer Science and Operations 
Research Université de Montréal, Pavillon André-Aisenstadt, PO Box 6128 Centre-Ville STN Montréal, Quebec H3C 3J7, Canada. “Google, 1600 Amphitheatre Parkway, Mountain View, California 
94043, USA. Department of Computer Science, University of Toronto, 6 King’s College Road, Toronto, Ontario M5S 3G4, Canada. 
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be seen as a kind of hilly landscape in the high-dimensional space of 
weight values. The negative gradient vector indicates the direction 
of steepest descent in this landscape, taking it closer to a minimum, 
where the output error is low on average. 

In practice, most practitioners use a procedure called stochastic 
gradient descent (SGD). This consists of showing the input vector 
for a few examples, computing the outputs and the errors, computing 
the average gradient for those examples, and adjusting the weights 
accordingly. The process is repeated for many small sets of examples 
from the training set until the average of the objective function stops 
decreasing. It is called stochastic because each small set of examples 
gives a noisy estimate of the average gradient over all examples. This 
simple procedure usually finds a good set of weights surprisingly 
quickly when compared with far more elaborate optimization tech- 
niques’®. After training, the performance of the system is measured 
on a different set of examples called a test set. This serves to test the 
generalization ability of the machine — its ability to produce sensible 
answers on new inputs that it has never seen during training. 
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Figure 1 | Multilayer neural networks and backpropagation. a, A multi- 
layer neural network (shown by the connected dots) can distort the input 
space to make the classes of data (examples of which are on the red and 
blue lines) linearly separable. Note how a regular grid (shown on the left) 
in input space is also transformed (shown in the middle panel) by hidden 
units. This is an illustrative example with only two input units, two hidden 
units and one output unit, but the networks used for object recognition 

or natural language processing contain tens or hundreds of thousands of 
units. Reproduced with permission from C. Olah (http://colah.github.io/). 
b, The chain rule of derivatives tells us how two small effects (that of a small 
change of x on y, and that of y on z) are composed. A small change Ax in 

x gets transformed first into a small change Ay in y by getting multiplied 
by dy/dx (that is, the definition of partial derivative). Similarly, the change 
Ay creates a change Az in z. Substituting one equation into the other 

gives the chain rule of derivatives — how Ax gets turned into Az through 
multiplication by the product of dy/dx and dz/dx. It also works when x, 

y and zare vectors (and the derivatives are Jacobian matrices). c, The 
equations used for computing the forward pass in a neural net with two 
hidden layers and one output layer, each constituting a module through 
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Many of the current practical applications of machine learning use 
linear classifiers on top of hand-engineered features. A two-class linear 
classifier computes a weighted sum of the feature vector components. 
If the weighted sum is above a threshold, the input is classified as 
belonging to a particular category. 

Since the 1960s we have known that linear classifiers can only carve 
their input space into very simple regions, namely half-spaces sepa- 
rated by a hyperplane”’. But problems such as image and speech recog- 
nition require the input-output function to be insensitive to irrelevant 
variations of the input, such as variations in position, orientation or 
illumination of an object, or variations in the pitch or accent of speech, 
while being very sensitive to particular minute variations (for example, 
the difference between a white wolf and a breed of wolf-like white 
dog called a Samoyed). At the pixel level, images of two Samoyeds in 
different poses and in different environments may be very different 
from each other, whereas two images of a Samoyed and a wolf in the 
same position and on similar backgrounds may be very similar to each 
other. A linear classifier, or any other ‘shallow’ classifier operating on 
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which one can backpropagate gradients. At each layer, we first compute 
the total input z to each unit, which is a weighted sum of the outputs of 

the units in the layer below. Then a non-linear function f(.) is applied to 

z to get the output of the unit. For simplicity, we have omitted bias terms. 
The non-linear functions used in neural networks include the rectified 
linear unit (ReLU) f(z) = max(0,z), commonly used in recent years, as 

well as the more conventional sigmoids, such as the hyberbolic tangent, 
f(2 = (exp(z) — exp(—z))/(exp(z) + exp(—z)) and logistic function logistic, 
f(z) =1/(1+ exp(-z)). d, The equations used for computing the backward 
pass. At each hidden layer we compute the error derivative with respect to 
the output of each unit, which is a weighted sum of the error derivatives 
with respect to the total inputs to the units in the layer above. We then 
convert the error derivative with respect to the output into the error 
derivative with respect to the input by multiplying it by the gradient of f(z). 
At the output layer, the error derivative with respect to the output ofa unit 
is computed by differentiating the cost function. This gives y,— t, if the cost 
function for unit J is 0.5(y;- t,)’, where t, is the target value. Once the 0E/0z;, 
is known, the error-derivative for the weight w,, on the connection from 
unit j in the layer below is just y; 0E/dz,. 
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Samoyed (16); Papillon (5.7); Pomeranian (2.7); Arctic fox (1.0); Eskimo dog (0.6); white wolf (0.4); Siberian husky (0.4) 
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Figure 2 | Inside a convolutional network. The outputs (not the filters) 
of each layer (horizontally) of a typical convolutional network architecture 
applied to the image of a Samoyed dog (bottom left; and RGB (red, green, 
blue) inputs, bottom right). Each rectangular image is a feature map 


raw pixels could not possibly distinguish the latter two, while putting 
the former two in the same category. This is why shallow classifiers 
require a good feature extractor that solves the selectivity-invariance 
dilemma — one that produces representations that are selective to 
the aspects of the image that are important for discrimination, but 
that are invariant to irrelevant aspects such as the pose of the animal. 
To make classifiers more powerful, one can use generic non-linear 
features, as with kernel methods”, but generic features such as those 
arising with the Gaussian kernel do not allow the learner to general- 
ize well far from the training examples”. The conventional option is 
to hand design good feature extractors, which requires a consider- 
able amount of engineering skill and domain expertise. But this can 
all be avoided if good features can be learned automatically using a 
general-purpose learning procedure. This is the key advantage of 
deep learning. 

A deep-learning architecture is a multilayer stack of simple mod- 
ules, all (or most) of which are subject to learning, and many of which 
compute non-linear input-output mappings. Each module in the 
stack transforms its input to increase both the selectivity and the 
invariance of the representation. With multiple non-linear layers, say 
a depth of 5 to 20, a system can implement extremely intricate func- 
tions of its inputs that are simultaneously sensitive to minute details 
— distinguishing Samoyeds from white wolves — and insensitive to 
large irrelevant variations such as the background, pose, lighting and 
surrounding objects. 


Backpropagation to train multilayer architectures 
From the earliest days of pattern recognition”, the aim of research- 
ers has been to replace hand-engineered features with trainable 
multilayer networks, but despite its simplicity, the solution was not 
widely understood until the mid 1980s. As it turns out, multilayer 
architectures can be trained by simple stochastic gradient descent. 
As long as the modules are relatively smooth functions of their inputs 
and of their internal weights, one can compute gradients using the 
backpropagation procedure. The idea that this could be done, and 
that it worked, was discovered independently by several different 
groups during the 1970s and 1980s*”’. 

The backpropagation procedure to compute the gradient of an 
objective function with respect to the weights of a multilayer stack 
of modules is nothing more than a practical application of the chain 
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corresponding to the output for one of the learned features, detected at each 
of the image positions. Information flows bottom up, with lower-level features 
acting as oriented edge detectors, and a score is computed for each image class 
in output. ReLU, rectified linear unit. 


rule for derivatives. The key insight is that the derivative (or gradi- 
ent) of the objective with respect to the input of a module can be 
computed by working backwards from the gradient with respect to 
the output of that module (or the input of the subsequent module) 
(Fig. 1). The backpropagation equation can be applied repeatedly to 
propagate gradients through all modules, starting from the output 
at the top (where the network produces its prediction) all the way to 
the bottom (where the external input is fed). Once these gradients 
have been computed, it is straightforward to compute the gradients 
with respect to the weights of each module. 

Many applications of deep learning use feedforward neural net- 
work architectures (Fig. 1), which learn to map a fixed-size input 
(for example, an image) to a fixed-size output (for example, a prob- 
ability for each of several categories). To go from one layer to the 
next, a set of units compute a weighted sum of their inputs from the 
previous layer and pass the result through a non-linear function. At 
present, the most popular non-linear function is the rectified linear 
unit (ReLU), which is simply the half-wave rectifier f(z) = max(z, 0). 
In past decades, neural nets used smoother non-linearities, such as 
tanh(z) or 1/(1+exp(—z)), but the ReLU typically learns much faster 
in networks with many layers, allowing training of a deep supervised 
network without unsupervised pre-training”. Units that are not in 
the input or output layer are conventionally called hidden units. The 
hidden layers can be seen as distorting the input in a non-linear way 
so that categories become linearly separable by the last layer (Fig. 1). 

In the late 1990s, neural nets and backpropagation were largely 
forsaken by the machine-learning community and ignored by the 
computer-vision and speech-recognition communities. It was widely 
thought that learning useful, multistage, feature extractors with lit- 
tle prior knowledge was infeasible. In particular, it was commonly 
thought that simple gradient descent would get trapped in poor local 
minima — weight configurations for which no small change would 
reduce the average error. 

In practice, poor local minima are rarely a problem with large net- 
works. Regardless of the initial conditions, the system nearly always 
reaches solutions of very similar quality. Recent theoretical and 
empirical results strongly suggest that local minima are nota serious 
issue in general. Instead, the landscape is packed with a combinato- 
rially large number of saddle points where the gradient is zero, and 
the surface curves up in most dimensions and curves down in the 
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remainder”. The analysis seems to show that saddle points with 
only a few downward curving directions are present in very large 
numbers, but almost all of them have very similar values of the objec- 
tive function. Hence, it does not much matter which of these saddle 
points the algorithm gets stuck at. 

Interest in deep feedforward networks was revived around 2006 
(refs 31-34) by a group of researchers brought together by the Cana- 
dian Institute for Advanced Research (CIFAR). The researchers intro- 
duced unsupervised learning procedures that could create layers of 
feature detectors without requiring labelled data. The objective in 
learning each layer of feature detectors was to be able to reconstruct 
or model the activities of feature detectors (or raw inputs) in the layer 
below. By ‘pre-training’ several layers of progressively more complex 
feature detectors using this reconstruction objective, the weights ofa 
deep network could be initialized to sensible values. A final layer of 
output units could then be added to the top of the network and the 
whole deep system could be fine-tuned using standard backpropaga- 
tion’**. This worked remarkably well for recognizing handwritten 
digits or for detecting pedestrians, especially when the amount of 
labelled data was very limited”. 

The first major application of this pre-training approach was in 
speech recognition, and it was made possible by the advent of fast 
graphics processing units (GPUs) that were convenient to program” 
and allowed researchers to train networks 10 or 20 times faster. In 
2009, the approach was used to map short temporal windows of coef- 
ficients extracted from a sound wave to a set of probabilities for the 
various fragments of speech that might be represented by the frame 
in the centre of the window. It achieved record-breaking results on a 
standard speech recognition benchmark that used a small vocabu- 
lary** and was quickly developed to give record-breaking results on 
a large vocabulary task”. By 2012, versions of the deep net from 2009 
were being developed by many of the major speech groups’ and were 
already being deployed in Android phones. For smaller data sets, 
unsupervised pre-training helps to prevent overfitting”, leading to 
significantly better generalization when the number of labelled exam- 
ples is small, or in a transfer setting where we have lots of examples 
for some ‘source’ tasks but very few for some ‘target’ tasks. Once deep 
learning had been rehabilitated, it turned out that the pre-training 
stage was only needed for small data sets. 

There was, however, one particular type of deep, feedforward net- 
work that was much easier to train and generalized much better than 
networks with full connectivity between adjacent layers. This was 
the convolutional neural network (ConvNet)*. It achieved many 
practical successes during the period when neural networks were out 
of favour and it has recently been widely adopted by the computer- 
vision community. 


Convolutional neural networks 

ConvNets are designed to process data that come in the form of 
multiple arrays, for example a colour image composed of three 2D 
arrays containing pixel intensities in the three colour channels. Many 
data modalities are in the form of multiple arrays: 1D for signals and 
sequences, including language; 2D for images or audio spectrograms; 
and 3D for video or volumetric images. There are four key ideas 
behind ConvNets that take advantage of the properties of natural 
signals: local connections, shared weights, pooling and the use of 
many layers. 

The architecture of a typical ConvNet (Fig. 2) is structured as a 
series of stages. The first few stages are composed of two types of 
layers: convolutional layers and pooling layers. Units in a convolu- 
tional layer are organized in feature maps, within which each unit 
is connected to local patches in the feature maps of the previous 
layer through a set of weights called a filter bank. The result of this 
local weighted sum is then passed through a non-linearity such as a 
ReLU. All units in a feature map share the same filter bank. Differ- 
ent feature maps in a layer use different filter banks. The reason for 
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this architecture is twofold. First, in array data such as images, local 
groups of values are often highly correlated, forming distinctive local 
motifs that are easily detected. Second, the local statistics of images 
and other signals are invariant to location. In other words, if a motif 
can appear in one part of the image, it could appear anywhere, hence 
the idea of units at different locations sharing the same weights and 
detecting the same pattern in different parts of the array. Mathemati- 
cally, the filtering operation performed by a feature map is a discrete 
convolution, hence the name. 

Although the role of the convolutional layer is to detect local con- 
junctions of features from the previous layer, the role of the pooling 
layer is to merge semantically similar features into one. Because the 
relative positions of the features forming a motif can vary somewhat, 
reliably detecting the motif can be done by coarse-graining the posi- 
tion of each feature. A typical pooling unit computes the maximum 
ofa local patch of units in one feature map (or in a few feature maps). 
Neighbouring pooling units take input from patches that are shifted 
by more than one row or column, thereby reducing the dimension of 
the representation and creating an invariance to small shifts and dis- 
tortions. Two or three stages of convolution, non-linearity and pool- 
ing are stacked, followed by more convolutional and fully-connected 
layers. Backpropagating gradients through a ConvNet is as simple as 
through a regular deep network, allowing all the weights in all the 
filter banks to be trained. 

Deep neural networks exploit the property that many natural sig- 
nals are compositional hierarchies, in which higher-level features 
are obtained by composing lower-level ones. In images, local combi- 
nations of edges form motifs, motifs assemble into parts, and parts 
form objects. Similar hierarchies exist in speech and text from sounds 
to phones, phonemes, syllables, words and sentences. The pooling 
allows representations to vary very little when elements in the previ- 
ous layer vary in position and appearance. 

The convolutional and pooling layers in ConvNets are directly 
inspired by the classic notions of simple cells and complex cells in 
visual neuroscience’, and the overall architecture is reminiscent of 
the LGN-V1-V2-V4-IT hierarchy in the visual cortex ventral path- 
way“. When ConvNet models and monkeys are shown the same pic- 
ture, the activations of high-level units in the ConvNet explains half 
of the variance of random sets of 160 neurons in the monkey’s infer- 
otemporal cortex*’. ConvNets have their roots in the neocognitron”’, 
the architecture of which was somewhat similar, but did not have an 
end-to-end supervised-learning algorithm such as backpropagation. 
A primitive 1D ConvNet called a time-delay neural net was used for 
the recognition of phonemes and simple words’. 

There have been numerous applications of convolutional net- 
works going back to the early 1990s, starting with time-delay neu- 
ral networks for speech recognition” and document reading”. The 
document reading system used a ConvNet trained jointly with a 
probabilistic model that implemented language constraints. By the 
late 1990s this system was reading over 10% of all the cheques in the 
United States. A number of ConvNet-based optical character recog- 
nition and handwriting recognition systems were later deployed by 
Microsoft”. ConvNets were also experimented with in the early 1990s 
for object detection in natural images, including faces and hands”, 
and for face recognition™. 


Image understanding with deep convolutional networks 
Since the early 2000s, ConvNets have been applied with great success to 
the detection, segmentation and recognition of objects and regions in 
images. These were all tasks in which labelled data was relatively abun- 
dant, such as traffic sign recognition”’, the segmentation of biological 
images™ particularly for connectomics”, and the detection of faces, 
text, pedestrians and human bodies in natural images*”’*"**, A major 
recent practical success of ConvNets is face recognition”. 
Importantly, images can be labelled at the pixel level, which will have 
applications in technology, including autonomous mobile robots and 
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Figure 3 | From image to text. Captions generated by a recurrent neural 
network (RNN) taking, as extra input, the representation extracted by a deep 
convolution neural network (CNN) from a test image, with the RNN trained to 
‘translate’ high-level representations of images into captions (top). Reproduced 


self-driving cars’. Companies such as Mobileye and NVIDIA are 
using such ConvNet-based methods in their upcoming vision sys- 
tems for cars. Other applications gaining importance involve natural 
language understanding” and speech recognition’. 

Despite these successes, ConvNets were largely forsaken by the 
mainstream computer-vision and machine-learning communities 
until the ImageNet competition in 2012. When deep convolutional 
networks were applied to a data set of about a million images from 
the web that contained 1,000 different classes, they achieved spec- 
tacular results, almost halving the error rates of the best compet- 
ing approaches’. This success came from the efficient use of GPUs, 
ReLUs, a new regularization technique called dropout”, and tech- 
niques to generate more training examples by deforming the existing 
ones. This success has brought about a revolution in computer vision; 
ConvNets are now the dominant approach for almost all recognition 
and detection tasks****’*-® and approach human performance on 
some tasks. A recent stunning demonstration combines ConvNets 
and recurrent net modules for the generation of image captions 
(Fig. 3). 

Recent ConvNet architectures have 10 to 20 layers of ReLUs, hun- 
dreds of millions of weights, and billions of connections between 
units. Whereas training such large networks could have taken weeks 
only two years ago, progress in hardware, software and algorithm 
parallelization have reduced training times to a few hours. 

The performance of ConvNet-based vision systems has caused 
most major technology companies, including Google, Facebook, 
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with permission from ref. 102. When the RNN is given the ability to focus its 
attention on a different location in the input image (middle and bottom; the 
lighter patches were given more attention) as it generates each word (bold), we 
found” that it exploits this to achieve better ‘translation’ of images into captions. 


Microsoft, IBM, Yahoo!, Twitter and Adobe, as well as a quickly 
growing number of start-ups to initiate research and development 
projects and to deploy ConvNet-based image understanding products 
and services. 

ConvNets are easily amenable to efficient hardware implemen- 
tations in chips or field-programmable gate arrays”. A number 
of companies such as NVIDIA, Mobileye, Intel, Qualcomm and 
Samsung are developing ConvNet chips to enable real-time vision 
applications in smartphones, cameras, robots and self-driving cars. 


Distributed representations and language processing 
Deep-learning theory shows that deep nets have two different expo- 
nential advantages over classic learning algorithms that do not use 
distributed representations”. Both of these advantages arise from the 
power of composition and depend on the underlying data-generating 
distribution having an appropriate componential structure”. First, 
learning distributed representations enable generalization to new 
combinations of the values of learned features beyond those seen 
during training (for example, 2” combinations are possible with n 
binary features). Second, composing layers of representation in 
a deep net brings the potential for another exponential advantage” 
(exponential in the depth). 

The hidden layers of a multilayer neural network learn to repre- 
sent the network’s inputs in a way that makes it easy to predict the 
target outputs. This is nicely demonstrated by training a multilayer 
neural network to predict the next word in a sequence from a local 
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context of earlier words’’. Each word in the context is presented to 
the network as a one-of-N vector, that is, one component has a value 
of 1 and the rest are 0. In the first layer, each word creates a different 
pattern of activations, or word vectors (Fig. 4). In a language model, 
the other layers of the network learn to convert the input word vec- 
tors into an output word vector for the predicted next word, which 
can be used to predict the probability for any word in the vocabulary 
to appear as the next word. The network learns word vectors that 
contain many active components each of which can be interpreted 
as a separate feature of the word, as was first demonstrated” in the 
context of learning distributed representations for symbols. These 
semantic features were not explicitly present in the input. They were 
discovered by the learning procedure as a good way of factorizing 
the structured relationships between the input and output symbols 
into multiple ‘micro-rules. Learning word vectors turned out to also 
work very well when the word sequences come from a large corpus 
of real text and the individual micro-rules are unreliable’’. When 
trained to predict the next word in a news story, for example, the 
learned word vectors for Tuesday and Wednesday are very similar, as 
are the word vectors for Sweden and Norway. Such representations 
are called distributed representations because their elements (the 
features) are not mutually exclusive and their many configurations 
correspond to the variations seen in the observed data. These word 
vectors are composed of learned features that were not determined 
ahead of time by experts, but automatically discovered by the neural 
network. Vector representations of words learned from text are now 
very widely used in natural language applications‘*’”””"”*, 

The issue of representation lies at the heart of the debate between 
the logic-inspired and the neural-network-inspired paradigms for 
cognition. In the logic-inspired paradigm, an instance of a symbol is 
something for which the only property is that it is either identical or 
non-identical to other symbol instances. It has no internal structure 
that is relevant to its use; and to reason with symbols, they must be 
bound to the variables in judiciously chosen rules of inference. By 
contrast, neural networks just use big activity vectors, big weight 
matrices and scalar non-linearities to perform the type of fast ‘intui- 
tive’ inference that underpins effortless commonsense reasoning. 

Before the introduction of neural language models”’, the standard 
approach to statistical modelling of language did not exploit distrib- 
uted representations: it was based on counting frequencies of occur- 
rences of short symbol sequences of length up to N (called N-grams). 
The number of possible N-grams is on the order of VN, where V is 
the vocabulary size, so taking into account a context of more than a 
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Figure 4| Visualizing the learned word vectors. On the left is an illustration 
of word representations learned for modelling language, non-linearly projected 
to 2D for visualization using the t-SNE algorithm"”. On the right is a 2D 
representation of phrases learned by an English-to-French encoder-decoder 
recurrent neural network”. One can observe that semantically similar words 
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handful of words would require very large training corpora. N-grams 
treat each word as an atomic unit, so they cannot generalize across 
semantically related sequences of words, whereas neural language 
models can because they associate each word with a vector of real 
valued features, and semantically related words end up close to each 
other in that vector space (Fig. 4). 


Recurrent neural networks 

When backpropagation was first introduced, its most exciting use was 
for training recurrent neural networks (RNNs). For tasks that involve 
sequential inputs, such as speech and language, it is often better to 
use RNNs (Fig. 5). RNNs process an input sequence one element at a 
time, maintaining in their hidden units a ‘state vector’ that implicitly 
contains information about the history of all the past elements of 
the sequence. When we consider the outputs of the hidden units at 
different discrete time steps as if they were the outputs of different 
neurons in a deep multilayer network (Fig. 5, right), it becomes clear 
how we can apply backpropagation to train RNNs. 

RNNs are very powerful dynamic systems, but training them has 
proved to be problematic because the backpropagated gradients 
either grow or shrink at each time step, so over many time steps they 
typically explode or vanish”. 

Thanks to advances in their architecture’ and ways of training 
them*'*’, RNNs have been found to be very good at predicting the 
next character in the text® or the next word ina sequence’’, but they 
can also be used for more complex tasks. For example, after reading 
an English sentence one word at a time, an English ‘encoder’ network 
can be trained so that the final state vector of its hidden units is a good 
representation of the thought expressed by the sentence. This thought 
vector can then be used as the initial hidden state of (or as extra input 
to) ajointly trained French ‘decoder’ network, which outputs a prob- 
ability distribution for the first word of the French translation. Ifa 
particular first word is chosen from this distribution and provided 
as input to the decoder network it will then output a probability dis- 
tribution for the second word of the translation and so on until a 
full stop is chosen’””””*. Overall, this process generates sequences of 
French words according to a probability distribution that depends on 
the English sentence. This rather naive way of performing machine 
translation has quickly become competitive with the state-of-the-art, 
and this raises serious doubts about whether understanding a sen- 
tence requires anything like the internal symbolic expressions that are 
manipulated by using inference rules. It is more compatible with the 
view that everyday reasoning involves many simultaneous analogies 
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or sequences of words are mapped to nearby representations. The distributed 
representations of words are obtained by using backpropagation to jointly learn 
a representation for each word and a function that predicts a target quantity 
such as the next word in a sequence (for language modelling) or a whole 
sequence of translated words (for machine translation)'*”’. 
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Figure 5 | A recurrent neural network and the unfolding in time of the 
computation involved in its forward computation. The artificial neurons 
(for example, hidden units grouped under node s with values s, at time f) get 
inputs from other neurons at previous time steps (this is represented with the 
black square, representing a delay of one time step, on the left). In this way, a 
recurrent neural network can map an input sequence with elements x, into an 
output sequence with elements 0, with each o, depending on all the previous 
x,’ (for t'<t). The same parameters (matrices U,V,W ) are used at each time 
step. Many other architectures are possible, including a variant in which the 
network can generate a sequence of outputs (for example, words), each of 
which is used as inputs for the next time step. The backpropagation algorithm 
(Fig. 1) can be directly applied to the computational graph of the unfolded 
network on the right, to compute the derivative of a total error (for example, 
the log-probability of generating the right sequence of outputs) with respect to 
all the states s, and all the parameters. 


that each contribute plausibility to a conclusion”. 


Instead of translating the meaning of a French sentence into an 
English sentence, one can learn to ‘translate’ the meaning of an image 
into an English sentence (Fig. 3). The encoder here is a deep Con- 
vNet that converts the pixels into an activity vector in its last hidden 
layer. The decoder is an RNN similar to the ones used for machine 
translation and neural language modelling. There has been a surge of 
interest in such systems recently (see examples mentioned in ref. 86). 

RNNs, once unfolded in time (Fig. 5), can be seen as very deep 
feedforward networks in which all the layers share the same weights. 
Although their main purpose is to learn long-term dependencies, 
theoretical and empirical evidence shows that it is difficult to learn 
to store information for very long”. 

To correct for that, one idea is to augment the network with an 
explicit memory. The first proposal of this kind is the long short-term 
memory (LSTM) networks that use special hidden units, the natural 
behaviour of which is to remember inputs for along time”. A special 
unit called the memory cell acts like an accumulator or a gated leaky 
neuron: it has a connection to itself at the next time step that has a 
weight of one, so it copies its own real-valued state and accumulates 
the external signal, but this self-connection is multiplicatively gated 
by another unit that learns to decide when to clear the content of the 
memory. 

LSTM networks have subsequently proved to be more effective 
than conventional RNNs, especially when they have several layers for 
each time step’, enabling an entire speech recognition system that 
goes all the way from acoustics to the sequence of characters in the 
transcription. LSTM networks or related forms of gated units are also 
currently used for the encoder and decoder networks that perform 
so well at machine translation’””””*. 

Over the past year, several authors have made different proposals to 
augment RNNs with a memory module. Proposals include the Neural 
Turing Machine in which the network is augmented by a ‘tape-like’ 
memory that the RNN can choose to read from or write to®, and 
memory networks, in which a regular network is augmented by a 
kind of associative memory”. Memory networks have yielded excel- 
lent performance on standard question-answering benchmarks. The 
memory is used to remember the story about which the network is 
later asked to answer questions. 

Beyond simple memorization, neural Turing machines and mem- 
ory networks are being used for tasks that would normally require 
reasoning and symbol manipulation. Neural Turing machines can 
be taught ‘algorithms. Among other things, they can learn to output 
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a sorted list of symbols when their input consists of an unsorted 
sequence in which each symbol is accompanied by a real value that 
indicates its priority in the list**. Memory networks can be trained 
to keep track of the state of the world in a setting similar to a text 
adventure game and after reading a story, they can answer questions 
that require complex inference”. In one test example, the network is 
shown a 15-sentence version of the The Lord of the Rings and correctly 
answers questions such as “where is Frodo now?””’. 


The future of deep learning 

Unsupervised learning” ** had a catalytic effect in reviving interest in 
deep learning, but has since been overshadowed by the successes of 
purely supervised learning. Although we have not focused on it in this 
Review, we expect unsupervised learning to become far more important 
in the longer term. Human and animal learning is largely unsupervised: 
we discover the structure of the world by observing it, not by being told 
the name of every object. 

Human vision is an active process that sequentially samples the optic 
array in an intelligent, task-specific way using a small, high-resolution 
fovea with a large, low-resolution surround. We expect much of the 
future progress in vision to come from systems that are trained end-to- 
end and combine ConvNets with RNNs that use reinforcement learning 
to decide where to look. Systems combining deep learning and rein- 
forcement learning are in their infancy, but they already outperform 
passive vision systems” at classification tasks and produce impressive 
results in learning to play many different video games’. 

Natural language understanding is another area in which deep learn- 
ing is poised to make a large impact over the next few years. We expect 
systems that use RNNs to understand sentences or whole documents 
will become much better when they learn strategies for selectively 
attending to one partat a time”. 

Ultimately, major progress in artificial intelligence will come about 
through systems that combine representation learning with complex 
reasoning. Although deep learning and simple reasoning have been 
used for speech and handwriting recognition for a long time, new 


paradigms are needed to replace rule-based manipulation of symbolic 


expressions by operations on large vectors’. m 
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Reinforcement learning improves 
behaviour from evaluative feedback 


Michael L. Littman! 


Reinforcement learning is a branch of machine learning concerned with using experience gained through interacting 
with the world and evaluative feedback to improve a system’s ability to make behavioural decisions. It has been called the 
artificial intelligence problem in a microcosm because learning algorithms must act autonomously to perform well and 
achieve their goals. Partly driven by the increasing availability of rich data, recent years have seen exciting advances in 
the theory and practice of reinforcement learning, including developments in fundamental technical areas such as gen- 
eralization, planning, exploration and empirical methodology, leading to increasing applicability to real-life problems. 


standing of decision making in humans and other animals in 

which learning is supervised through the use of reward signals 
in response to the observed outcomes of actions. As our understanding 
of this class of problems improves, so does our ability to bring it to bear 
in practical settings. Reinforcement learning is having a considerable 
impact on nuts-and-bolts questions such as how to create more effec- 
tive personalized Web experiences, as well as esoteric questions such 
as how to design better computerized players for the traditional board 
game Go or 1980s video games. Reinforcement learning is also provid- 
ing a valuable conceptual framework for work in psychology, cognitive 
science, behavioural economics and neuroscience that seeks to explain 
the process of decision making in the natural world. 

One way to think about machine learning is as a set of techniques 
to try to answer the question: when I am in situation x, what response 
should I choose? Asa concrete example, consider the problem of assign- 
ing patrons to tables in a restaurant. Parties of varying sizes arrive in 
an unknown order and the host allocates each party to a table. The 
host maps the current situation x (size of the latest party and informa- 
tion about which tables are occupied and for how long) to a decision 
(which table to assign to the party), trying to satisfy a set of competing 
goals such as minimizing the waiting time for a table, balancing the load 
among the various servers, and ensuring that the members of a party 
can sit together. Similar allocation challenges come up in games such 
as Tetris and computational problems such as data-centre task alloca- 
tion. Viewed more broadly, this framework fits any problem in which a 
sequence of decisions needs to be made to maximize a scoring function 
over uncertain outcomes. 

The kinds of approaches needed to learn good behaviour for the 
patron-assignment problem depend on what kinds of feedback infor- 
mation are available to the decision maker while learning (Fig. 1). 

Exhaustive versus sampled feedback is concerned with the cover- 
age of the training examples. A learner given exhaustive feedback is 
exposed to all possible situations. Sampled feedback is weaker, in that 
the learner is only provided with experience of a subset of situations. 
The central problem in classic supervised learning is generalizing from 
sampled examples. 

Supervised versus evaluative feedback is concerned with how the 
learner is informed of right and wrong answers. A requirement for 
applying supervised learning methods is the availability of examples 
with known optimal decisions. In the patron-assignment problem, a 
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host-in-training could work as an apprentice to a much more expe- 
rienced supervisor to learn how to handle a range of situations. If the 
apprentice can only learn from supervised feedback, however, she would 
have no opportunities to improve after the apprenticeship ends. 

By contrast, evaluative feedback provides the learner with an assess- 
ment of the effectiveness of the decisions that she made; no information 
is available on the appropriateness of alternatives. For example, a host 
might learn about the ability ofa server to handle unruly patrons by trial 
and error: when the host makes an assignment of a difficult customer, 
it is possible to tell whether things went smoothly with the selected 
server, but no direct information is available as to whether one of the 
other servers might have been a better choice. The central problem in 
the field of reinforcement learning is addressing the challenge of evalu- 
ative feedback. 

One-shot versus sequential feedback is concerned with the rela- 
tive timing of learning signals. Evaluative feedback can be subdivided 
into whether it is provided directly for each decision or whether it has 
longer-term impacts that are evaluated over a sequence of decisions. 
For example, if a host needs to seat a party of 12 and there are no large 
tables available, some past decision to seat a small party at a big table 
might be to blame. Reinforcement learners must solve this temporal 
credit assignment problem to be able to derive good behaviour in the 
face of this weak sequential feedback. 

From the beginning, reinforcement-learning methods have con- 
tended with all three forms of weak feedback simultaneously — sam- 
pled, evaluative and sequential feedback. As such, the problem is 
considerably harder than that of supervised learning. However, methods 
that can learn from weak sources of feedback are more generally appli- 
cable and can be mapped to a variety of naturally occurring problems, 
as suggested by the patron-assignment problem. 

The following sections describe recent advances in several sub-areas 
of reinforcement learning that are expanding its power and applicability. 


Bandit problems 

The k-armed bandit problem concerns learning to make decisions from 
one-shot, exhaustive evaluative feedback’. It is both harder (evaluative 
versus supervised feedback) and easier (exhaustive versus sampled feed- 
back) than supervised learning. Initially, the problem was motivated by 
decision making in medical trials — prescribing a treatment to a test 
subject only reveals the outcome of that experiment and not the out- 
come of competing treatments. Decision making in this setting involves 
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supervised 


Figure 1 | Decisions of machine-learning feedback. Three kinds of feedback 
(sequential versus one-shot, sampled versus exhaustive and evaluative versus 
supervised) in machine learning and examples of learning problems (bandits, 
tabular reinforcement learning, reinforcement learning, contextual bandits 
and supervised machine learning) that result from their combination. 


an inherent exploration-exploitation trade-off as the learner is trying 
to perform well, but also needs to test out courses of action that could 
possibly be better than the current best guess. 

With increasing attention being paid to personalized medicine’, ban- 
dit algorithms could become popular again in the health-care setting. 
However, the main driver of recent interest in these algorithms is the 
problem of online content delivery. A company providing web pages toa 
large user base has many decisions it can make concerning the organiza- 
tion of the information it is providing. Should the page include a large 
image or a smaller image with space for a brief textual description? A 
simple formulation of this decision is as a two-armed bandit problem. 
Each time the system needs to present a page to the user, it chooses 
between layout A and layout B. It can then estimate the appropriateness 
of its decision by whether the user clicks on the presented link and, if 
so, how long the user spends reading and exploring the resulting pages. 
This information can then be used to inform future layout decisions. 

Good algorithms for this problem have been known for more than 
50 years. However, increased attention recently has resulted in new anal- 
yses and a deeper understanding of which algorithms are most effec- 
tive in which situations. One popular approach is the upper confidence 
bound (UCB) algorithm’. The idea of the algorithm is quite simple — on 
the basis of the number of times each alternative has been tested and 
the outcomes of these tests, the algorithm estimates the likelihood that 
a link will be selected (its ‘click-through rate’) for layout A and B sepa- 
rately. Statistical confidence bounds of the form ‘with 95% probability, 
A has a click-through rate of 5%, plus or minus 10%’ are computed. The 
upper confidence bound is the sum of the estimated click-through rate 
and the confidence bound; 15%, in this case. There are two reasons a 
condition might have a higher upper confidence bound: it has a higher 
estimated click-through rate (because it seems to be a good option — 
exploitation is warranted) or it has a wider confidence range (because 
it has not received as many tests — exploration is warranted). The UCB 
heuristic, then, is to always choose the option with the highest upper 
confidence bound resulting in either high reward or valuable data for 
learning. UCB is related to an earlier algorithm called interval estima- 
tion®, butit is defined so it achieves excellent guarantees on its long-run 
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regret — roughly speaking, it minimizes the number of clicks lost owing 
to showing the wrong version of the page. 

The UCB algorithm, as described, pays no attention to context. Its 
choices compare well with the best selection averaged over the entire 
population of users. Modern web pages have additional information 
that can be brought to bear, however. The situation at display time, x, 
includes facts that the system has about the user — recent pages visited, 
possibly demographic information and perhaps even a record of the 
user’s recent purchases. The system can use this information to create 
the most valuable, interesting or engaging page possible. For example, 
a news site might select a business or sports headline tailored to the 
current user’s preferences. We, thus, need a way to make good decisions 
in spite of both evaluative feedback and sampled feedback, a setting 
that combines aspects of k-armed bandits and supervised learning, now 
known as contextual bandits. UCB has been extended for use in this 
context’. Indeed, contextual bandit algorithms are increasingly being 
used by companies with a major Web presence. 

A modern approach to bandit problems with quite deep roots is a 
method now known as Thompson sampling”. It bears some similarity 
to a phenomenon known in psychology as probability matching”; as a 
bandit algorithm, the idea is to use the observations obtained thus far 
to maintain a posterior distribution over the click-through rate of the 
alternatives. When it is time to make a decision, the algorithm samples a 
click-through rate for each alternative from its corresponding posterior 
distribution, then selects the alternative that was assigned the highest 
value. Or, more simply, it chooses among the alternatives proportion- 
ally to the probability that each is best. Thompson sampling behaves 
somewhat like a noisier version of UCB, in that uncertain alternatives 
or confident-but-high-scoring alternatives are most likely to be selected. 
It can be used in similar scenarios as UCB and can also be shown to 
possess similar guarantees on its performance’*”’. One highly desirable 
property, however, is that it provides a direct way of applying the emerg- 
ing class of non-parametric Bayesian methods” to decision making — if 
you can model a process probabilistically, you can use that model to 
make a selection. This makes it very well suited to decision making in 
contextual bandit problems. 


Temporal difference learning 

Learning from sequential feedback, also known as the temporal credit 
assignment problem, is an essential challenge faced by decision makers 
whose choices have both immediate and indirect effects. The essence 
of the idea can be illustrated through noughts and crosses. Imagine the 
computer is playing a nought against a strong opponent playing a cross. 
In a series of boards (Fig. 2), nought makes two moves (B and D) and 
ultimately loses. Which move should be blamed for the loss? A natural 
assumption is that both moves participated in the loss and therefore 
both are equally responsible, or that the move that was closer in time to 
the loss (D) has more responsibility. In this case, however, it was one of 
nought’s earlier moves (B) that set the stage for its defeat. 

The concept of temporal difference learning” provides an algorithmic 
way to make predictions about the long-term implications of selections. 
Over a series of games, the learner can estimate whether she is more 
likely to win or lose from each of the boards it encounters. We can write 
V(A) = 0 to represent the fact that nought is likely to tie from board A, 
and V(B) = V(C) = V(D) = V(E) =—1 to represent the fact that nought 
should be expected to lose from the other boards. The temporal differ- 
ence error between two consecutive boards x and x’ is V(x’) — V(x). Here, 
the temporal difference error is 0 everywhere except for the transition 
from A to B — it is the decision that changed nought’ situation from a 
tie to a loss that should be held responsible for the loss. 

Leveraging its utility as a marker of successful and unsuccessful deci- 
sions, the temporal difference error can be used as a learning signal for 
improving the value estimates. In particular, an ideal predictor of value 
will have the property that the long-term prediction for a situation x 
should be the same as the expected immediate outcome plus the pre- 
diction for the resulting situation x’: V(x) ~ E,[r(x’) + V(x’)]. Here, r(x’) 
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represents the feedback received from the environment for the transi- 
tion to x’. Classic temporal difference methods set about minimizing 
the difference between these quantities. 

Reinforcement learning in the face of evaluative, sampled and sequen- 
tial feedback requires combining methods for temporal credit assign- 
ment with methods for generalization. Concretely, ifthe V values are 
represented with a neural network or similar representation, finding 
predictions such that V(x) ~ Ex'[r(x’) + V(x’)] can lead to instability and 
divergence in the learning process'*””. 

Recent work"* has modified the goal of learning slightly by incor- 
porating the generalization process into the learning objective 
itself. Specifically, consider the goal of seeking V values such that 
V(x) = TIE, [r (x’) + V(x’)] where I] is the projection of the values resulting 
from their representation by the generalization method. When feedback 
is exhaustive and no generalization is used, I] is the identity function 
and this goal is no different from classic temporal difference learning. 
However, when linear functions are used to represent values, this modi- 
fied goal can be used to create provably convergent learning algorithms”. 

This insight was rapidly generalized to non-linear function approxi- 
mators that are smooth (locally linear)'*, to the control setting”, and to 
the use of eligibility traces in the learning process”. This line of work 
holds a great deal of promise for creating reliable methods for learning 
effective behaviour from very weak feedback. 


Planning 

A closely related problem to that of temporal credit assignment is 
planning. The essence of both of these problems is that when actions 
have both immediate and situation-altering effects, decisions need to 
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Figure 2 | A series of boards in noughts and crosses leading to a loss for 
nought. Against a strong opponent, cross (moves A, C and E), nought makes 
two moves (B and D) and ultimately loses. In this example, nought’s earlier 
move (B) set the stage for the defeat. 


be sensitive to both immediate and long-term outcomes. Temporal 
difference learning provides one mechanism for factoring long-term 
outcomes into the decision process. Planning is another in that it consid- 
ers the implications of taking entire sequences of actions and explicitly 
makes a trade-off among them. Planning has long been studied in arti- 
ficial intelligence”, but research in reinforcement learning has brought 
about a number of powerful innovations. 

A particularly challenging kind of planning that is relevant in the 
reinforcement-learning setting is planning under uncertainty. Here, 
outcomes of decisions are modelled as being drawn from a probabil- 
ity distribution over alternatives conditioned on the learner’s decision 
(Box 1). A familiar example of this kind of planning is move selection 
in board games. Ifa computer program is playing against a human, it 
needs to make moves that are likely to lead to a win despite not know- 
ing precisely which moves its opponent will take. For a game such as 
noughts and crosses, a computer program can exhaustively enumerate 
all the reachable boards — the so-called game tree — and calculate 


BOX1 


Markov decision processes specify setting and tasks 


Reinforcement learning is characterized as an interaction between a 
learner and an environment that provides evaluative feedback. The 
environment is often conceptualized as a Markov decision process — a 
formal mathematical model that dates back to the 1950s (refs 55, 56). 
A Markov decision process is defined by a set of states S (situations 
in which a decision can be made), and actions A (the decisions or 
interventions the decision maker can select). These quantities can 
be taken to be finite, but continuous state and action spaces are 
often valuable for capturing interactions in important reinforcement- 
learning applications such as robotic control®”. A transition function 
P(s'|s,a) defines the probability of the state changing from s to s’ under 
the influence of action a. It specifies the ‘physics’ or ‘dynamics’ of the 
environment. 

The decision maker’s task is defined by a reward function R(s,a) 
and discount factor y € [0, 1]. Rewards are delivered to the decision 
maker with each transition and maximizing the cumulative discounted 
expected reward is its objective. Specifically, the decision maker seeks 
a behaviour 1 mapping states to actions generating a sequence of 
rewards o,f, fo, fs, -.. such that Ey, 1, -.- [fot Yn trot V7rst...] is as 


P(s'Is, a) 


Z__RGa) 


large as possible. The relation between the environmental interaction 
(state, action, reward, state, action, reward, ...) and the cumulative 
discounted expected reward is captured by the Bellman equation 
(Figure) for the optimal state-action value function Q*. The solution 
to the Bellman equation can be used to define optimal behaviour by 
T*(s) =arg max,Q*(s, a). The cumulative discounted expected reward 
for the policy that takes action a from state s and then behaving 
optimally thereafter is the immediate reward received plus the 
discounted expected value of the cumulative discounted expected 
reward from the resulting state s’ given that the best action is chosen. 

Planning methods use knowledge of P and R to compute a good 
policy tr. Reinforcement-learning methods have access to Pand R only 
through their ongoing interactions with the environment and must 
learn good behaviour. Note that the Markov decision process model 
captures both sequential feedback and the more specific one-shot 
feedback (when P(s’|s, a) is independent of s and a). It also captures 
both exhaustive feedback and the more general sampled feedback 
(when states are represented by features that allow R(s, a) to be 
represented compactly). 


Q*(s, a) = RG, a) + y2P(s'Is, a) max, OMS, a’) 
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Evaluation-function methods 


Full tree 


Monte Carlo methods 


Figure 3 | Schematic comparison of evaluation-function and Monte Carlo 
methods for planning. The top of the tree represents the current state for 
which a decision is needed. The red area represents the entire search tree 

— too large to be examined. The green area represents the nodes of the tree 
visited by evaluation-function methods — they expand to a fixed depth and 
then use an evaluation function to approximate the value received at the end 
of the game. The blue area represents the nodes of the tree visited by Monte 
Carlo methods — they consider a subset of nodes that ignore some actions 
(making it narrow) but continue to the leaves of the search tree (making 

it deep). Some decision problems are much more amenable to this type of 
approximation. 


whether its moves will lead to a win, a loss or a tie from each. In a game 
such as chess, however, the size of the game tree is astronomical and the 
best an algorithm can hope for is to assemble a representative sample of 
boards to consider. The classic approach for dealing with this problem 
is to use an evaluation function. Programs search as deeply in the game 
tree as they can, but if they cannot reach the end of the game where the 
outcome is known, a heuristic evaluation function is used to estimate 
how the game will end. Even a moderately good guess can lead to excel- 
lent game play in this context — evaluation-function methods have 
been used in chess from the earliest days of artificial intelligence to the 
decision making in the Deep Blue system that beat the World Chess 
Champion Garry Kasparov”. 

A powerful variation of this idea is to use machine learning to 
improve the evaluation function over time. Here, temporal difference 


learning can be used in simulated games to provide opportunities to 
learn an accurate evaluation function represented as, for example, a 
neural network. Celebrated examples of the use of temporal difference 
methods combined with generalization were in the domains of check- 
ers’ and backgammon™. A similar approach was responsible for Daily 
Double wagering in the Watson system for playing the US television 
game show Jeopardy!”. 

In spite of their impressive, indeed superhuman, performance in 
some games, evaluation-function methods have not been universally 
successful. A notable example is the game of Go, in which good evalua- 
tion functions have been hard to come by, resulting in subpar game play. 
A relatively recent breakthrough was the introduction of Monte Carlo 
tree search (MCTS) methods that swap an estimate of the evaluation 
function for an estimate of a good policy. Whereas evaluation-function 
methods try all possible actions out to a fixed depth and estimate the 
value of the resulting boards, Monte Carlo methods search all the way to 
the end of the game where evaluations are exact, but use a heuristic rule 
to suggest which subset of actions are worth considering (Fig. 3). Use of 
the upper confidence intervals in trees (UCT) algorithm” resulted in a 
quantum leap in performance in Go”. It leverages the UCB algorithm 
to intelligently allocate search resources at each point in the tree. 


Model-based reinforcement learning 

To plan, a learning algorithm needs to be able to predict the imme- 
diate outcomes of its actions. Since it can try an action and observe 
the effects right away, this part of the problem can be addressed using 
supervised learning methods. In model-based reinforcement-learning 
approaches, experiences are used to estimate a transition model, map- 
ping from situations and decisions to resulting situations, and then a 
planning approach is used to make decisions that maximize predicted 
long-term outcomes. 

Model-based reinforcement-learning methods and model-free meth- 
ods such as temporal differences strike a different balance between com- 
putational expense and experience efficiency — to a first approximation, 
model-free methods are inexpensive, but slow to learn; whereas model- 
based methods are computationally intensive, but squeeze the most out 
of each experience (Box 2). In applications such as learning to manage 
memory resources in a central processing unit, decisions need to be 
made at the speed of a computer’s clock and data are abundant, making 
temporal difference learning an excellent fit*. In applications such as a 
robot helicopter learning acrobatic tricks, experience is slow to collect 
and there is ample time offline to compute decision policies, making a 
model-based approach appropriate”. There are also methods that draw 
on both styles of learning” to try to get the complementary advantages. 

Model-based algorithms can make use of insights from supervised 
learning to deal with sampled feedback when learning their transition 


BOX2 


Tabular reinforcement-learning problems are those in which the 
state-action space can be explored exhaustively. Although challenging, 
there are a number of relatively simple algorithms that can be used 
effectively in this setting. Q-learning”® is a model-free algorithm that 
uses transition experience of the form <s, a, r, s’> (from state s, action a 
resulted in reward rand next state s’) to improve an estimate Q of the 
optimal state-action value function Q* . In particular, since r+ymax,, 
Q*(s', a’) is an unbiased estimator of Q*(s, a) as defined by the Bellman 
equation and Q is an estimate of Q*, the estimate can be updated as: 


O(s,a) ~Q(s,a)+a(r+y max Q(s',’)-Q(,a)), 


where ais a learning-rate parameter that controls how new 
estimates are iteratively blended together over time. Indeed, 


Two approaches to tabular reinforcement learning 


if the learning rate is decayed at an appropriate rate, 


Q(s,a) +Q*(s, a) 


for all state-action pairs™. 

A simple model-based approach to tabular reinforcement learning 
uses the observed transition experience to estimate R(s, a) (by 
averaging all rewards received from state s and action a) and P(s’|s, a) 
(by the fraction of transitions from state s and action a that transition to 
state s’). Solving the Bellman equation — through planning algorithms 
such as value iteration, policy iteration, UCT (upper confidence 
intervals in trees) or linear programming methods — using these 
estimates leads to an approximation Q of the state-action value 
function that converges to Q* in the limit of infinite data. 
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BOX 3 


The KWIK (‘knows what it knows’) learning setting’? provides a 
powertul set of algorithms for rapidly learning accurate predictions 
and confidently distinguishing learned from not-yet-learned instances. 
In the context of reinforcement-learning algorithms, this ability to 

tag predictions as uncertain can be exploited to drive exploration by 
substituting optimistic estimates for those that remain unknown. 

A growing set of KWIK-learnable function classes has been 
identified. As a simple example, consider trying to learn a function of 
the form f(x)=is x divisible by k? for some unknown natural number 
k. The Table shows what a KWIK learning algorithm for this problem 
would predict given training examples f(6)=yes and f(20)=no. It 
would know f(6)=yes and f(20)=no from these examples. It would be 
able to reason that the unknown KE {2, 3, 6} from these examples and 
therefore that ((25)=no. By contrast, f(15)=1 don’t know because it 
would be ‘yes’ if k=3 and ‘no’ if k=6. Despite this lingering ambiguity, 
it could conclude that f(24)=yes, since that would be the answer for 
any of the remaining possible values of k. This last element lies at the 
heart of all KWIK-learning algorithms — not ‘wasting’ its | don’t know 
answers on examples whose outputs can be inferred from the data. 


Self-aware learning in reinforcement learning 


For noisy function classes, a KWIK learner must be able to predict 
the probability of various outputs given an input. This property is 
important when learning Markov decision process transition functions. 
Some challenging function classes for which KWIK learning algorithms 
are known include noisy linear regression, and noisy union. Noisy 
union, also called the ‘adaptive k meteorologist’ problem®, lets the 
learner choose a function from a finite class when there is a non-zero 
probability that training examples are corrupted. 


Table 1 | A training set (f(6)=yes, f(20) = no) for a function of the 
form ‘f(x)=is x divisible by k?’ for some unknown natural number 
k, and the predictions made by a valid KWIK algorithm. 


Input x Output f(x) Prediction for f(x) 
6 Yes Yes 

15 | don’t know 
20 No No 

24 Yes 

25 No 


models and can tackle the problem of sequential feedback by planning 
actions to achieve their goals. The issue of evaluative feedback — the 
fact that the system can only learn about transitions it has actually expe- 
rienced — makes it very important that learners balance exploration 
and exploitation. Several methods are known for achieving guaranteed 
efficiency in model-based reinforcement learning’ by explicitly mod- 
elling the certainty of predictions and, much like UCB, taking an opti- 
mistic view of the uncertainty. Concretely representing the uncertainty 
in learning leads to the framework ‘knows what it knows’*’ (Box 3), 
which has been shown to work well with planning algorithms in model- 
based reinforcement-learning settings. 


Empirical methods 

An important turning point in the development of practical super- 
vised learning methods was a shift from artificial learning problems 
to learning problems grounded in measured data and careful experi- 
mental design™. The reinforcement-learning community is showing 
signs of a similar evolution, which promises to help to solidify the 
technical gains described in the previous sections. Of particular inter- 
est are new evaluation methodologies that go beyond case studies of 
individual algorithm learning in individual environments to con- 
trolled studies of different learning algorithms and their performance 
across multiple natural domains. In supervised learning, data-set col- 
lections such as the University of California, Irvine Machine Learning 
Repository (http://archive.ics.uci.edu/ml/) and the Weka Machine 
Learning library (http://www.cs.waikato.ac.nz/ml/weka/) facilitate 
this kind of research by providing data for learning problems and a 
suite of algorithms in a modular form. A challenge for the study of 
reinforcement learning, however, is that learning takes place through 
active exploration between the learner and her environment. Envi- 
ronments are almost always represented by simulators, which can be 
difficult to standardize and share, and are rarely realistic stand-ins 
for the actual environment of interest. 

One noteworthy exception is the Arcade Learning Environment”, 
which provides a reinforcement-leaning-friendly interface for an 
eclectic collection of more than 50 Atari 2600 video games. Although 
video games are by no means the only domain of interest, the diver- 
sity of game types and the unity of the interface makes the Arcade 
Learning Environment an exciting platform for the empirical study of 
machine reinforcement learning. An extremely impressive example of 


research enabled by this repository is a recent study of deep learning 
in the reinforcement-learning setting. Researchers combined a set 
of existing and novel algorithm elements to create an agent capable 
of ‘human-level’ performance” across the set of games. It remains to 
be seen whether the algorithmic approaches highlighted in this work 
become standard weapons in the reinforcement-learning arsenal, but 
the remarkable success on these challenging learning problems has 
attracted a great deal of attention and is likely to generate advances 
in the theory and practice of reinforcement learning. 


A perspective on future impacts 
There are several current directions that promise considerable impacts 
in the years to come. 


Offline evaluation 

One challenge to designing practical, reliable learning systems for real- 
life reinforcement-learning problems is that the learning problem is 
carried out online. That is, in contrast to supervised learning systems 
that work with a training set that contains measured, but frozen, data, 
reinforcement learning is defined with regard to a continual interaction 
between the learner and an environment. 

To overcome this difficulty, several efforts are underway to col- 
lect static data sets in a way that allows them to be used for evaluating 
reinforcement-learning algorithms. In the biostatistics community, a 
new type of clinical trial”’ called a sequential, multiple assignment ran- 
domized trial (SMART) has been developed that makes it possible to 
gather treatment responses in a way that supports the synthesis of adap- 
tive treatment plans. That is, the trial assesses the effectiveness of treat- 
ments expressed as flow charts in which an intervention is applied and, 
depending on the patient’s response, the plan follows up with different 
possible interventions. A typical SMART can evaluate a flow chart con- 
taining eight different possible treatment plans using half the participants 
that would otherwise be needed. 

SMARTSs evaluate flow-chart-like treatment plans, but not complete 
learning algorithms. Learning algorithms can express extremely com- 
plex contingent relationships between the data acquired during learning 
and the decisions made in response to it. Consider a simple two-armed 
bandit problem over a set of ten steps. The tenth decision in the sequence 
is conditioned on the outcomes of all the previous decisions. Thus, for 
each of the 27=512 possible contexts, our bandit algorithm needs to make 
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a good choice. Longer learning periods and contextual decisions make 
the number of possibilities to consider astronomically high. Despite 
this daunting combinatorial explosion, offline evaluation procedures 
for contextual bandit problems have been proposed™. The key idea is to 
collect a static (but enormous) set of contextual bandit decisions by run- 
ning a uniform random selection algorithm on a real problem (a news 
article recommendation, for example). Then, to test a bandit algorithm 
using this collection, present the algorithm with the same series of con- 
texts. Whenever the selection made by the algorithm does not match 
the selection made during data collection, the algorithm is forced to 
forget what it saw. Thus, as far as the algorithm ‘knows; all of its choices 
during learning matched the choices made during data collection. The 
end result is an unbiased evaluation of a dynamic bandit algorithm using 
static pre-recorded data. 

One reason that this approach works for contextual bandit problems 
is that — apart from the state of the learning algorithm — there are no 
dependencies from one round of decision making to the next. So, skip- 
ping many rounds because the algorithm and the data collection do not 
match does not change the fundamental character of the learning prob- 
lem. In the presence of sequential feedback, however, this trick no longer 
applies. Proposals have been made for creating reinforcement-learning 
data repositories” and for evaluating policies using observational data 
sets*’, but no solid solution for evaluating general learning algorithms 
has yet emerged. 


Finding appropriate reward functions 

Reinforcement-learning systems strive to identify behaviour that maxi- 
mizes expected total reward. There is a sense, therefore, that they are 
‘programmable’ through their reward functions — the mappings from 
situation to score. A helpful analogy to consider is between learning 
and program interpretation in digital computers: a program (reward 
function) directs an interpreter (learning algorithm) to process inputs 
(environments) into desirable outputs (behaviour). The vast majority 
of research in reinforcement learning has been into the development 
of effective interpreters with little attention paid to how we should be 
programming them (providing reward functions). 

There are a few approaches that have been explored for producing 
reward functions that induce a target behaviour given some conception 
of that behaviour. As a simple example, if one reward function for the 
target behaviour is known, the space of behaviour-preserving transfor- 
mations to this reward function is well understood”; other, essentially 
equivalent, reward functions can be created. Ifa human teacher is avail- 
able who can provide evaluative feedback, modifications to intended 
target reinforcement-learning algorithms can be used to search for the 
target behaviour”. The problem of inverse reinforcement learning**** 
addresses the challenge of creating appropriate reward functions given 
the availability of behavioural traces from an expert executing the target 
behaviour. It is called ‘inverse’ reinforcement learning because the learner 
is given behaviour and needs to generate a reward function instead of 
the other way around. These methods infer a reward function that the 
demonstrated behaviour optimizes. One can turn to evolutionary opti- 
mization” to generate reward functions if there is an effective way to 
evaluate the appropriateness of the resulting behaviour. One advantage of 
the evolutionary approach is that, among the many possible reward func- 
tions that generate good behaviour, it can identify ones that provide help- 
ful but not too distracting hints that can speed up the learning process. 

Looking forward, new techniques for specifying complex behaviour and 
translations of these specifications into appropriate reward functions are 
essential. Existing reward-function specifications lack the fundamental 
ideas that enable the design of today’s massive software systems, such as 
abstraction, modularity and encapsulation. Analogues of these ideas could 
greatly extend the practical utility of reinforcement-learning systems. 


Reinforcement learning as a cognitive model 
The term reinforcement learning itself originated in the animal- 
learning community. It had long been observed that certain stimuli, 


450 | NATURE | VOL 521 | 28 MAY 2015 


if present after an animal selects a particular behavioural action, cause 
that behavioural action to become more frequent in the future. These 
stimuli became known as reinforcers because of their role in strength- 
ening behaviour. The pioneers of artificial intelligence were aware of 
this idea**” and used the term to describe computational processes 
that would similarly increase the intensity or likelihood of a response. 
Indeed, according to the software tool Google Ngrams (which charts 
the frequency of phrases in its database of published books; https:// 
books.google.com/ngrams), the appearance of ‘reinforcement learning’ 
reached a local maximum in the mid- 1960s, as it was in use by research- 
ers studying the theory of animal learning, education and cybernetics. 

The term lost ground through the 1970s and early 1980s, but was 
picked up again by computer scientists in the mid-to-late 1980s” 
and became a central area of study in machine learning. A subtle but 
important difference was introduced during this second wave. The term 
‘reinforcement’ came to refer to a generic reward signal that needs to 
be predicted over time instead of the activity of strengthening a behav- 
ioural response. Classic learning problems could still be discussed in this 
way, but the door opened to a wide variety of other powerful algorithmic 
approaches, such as temporal difference learning, for which behavioural 
strengthening was nota direct design goal. 

As already discussed, the predictive view proved to be powerful and 
productive. Recent work in neuroscience and cognitive psychology has 
leveraged this revised understanding of reinforcement learning to create 
new insights into biological learning. Of particular note is the discovery 
ofa temporal difference signal encoded in the firing of dopamine neu- 
rons”, which has created a bridge between computer scientists and cog- 
nitive neuroscientists centred on reinforcement-learning algorithms”. 
In addition to temporal difference methods, cognitive scientists have 
found the distinction between model-based (goal-directed) and model- 
free (habitual) learning to be a fertile one. For example, some aspects 
of how people make moral judgments™ can be fruitfully described by 
associating action-based preferences (do not do A because it is wrong) 
with model-free learning and outcome-based preferences (do not do 
A because it will lead to a bad situation) with model-based learning. 

In October 2013, the first Multi-disciplinary Conference on Rein- 
forcement Learning and Decision Making was held at Princeton Uni- 
versity in New Jersey. In addition to computer scientists, psychologists 
and neuroscientists, the invited speakers included engineers, roboticists, 
a zoologist, a psychiatrist and a behavioural game theorist. The talks 
were notable both for the breadth of topics and the convergence of the 
underlying conceptual framework discussed in this Review. As our for- 
mal and practical understanding of learning from evaluative feedback 
expands, so does the community of scientists and engineers who can 
contribute to and benefit from these ideas. m 
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Probabilistic machine learning 
and artificial intelligence 


Zoubin Ghahramani! 


How cana machine learn from experience? Probabilistic modelling provides a framework for understanding what learn- 
ing is, and has therefore emerged as one of the principal theoretical and practical approaches for designing machines 
that learn from data acquired through experience. The probabilistic framework, which describes how to represent and 
manipulate uncertainty about models and predictions, has a central role in scientific data analysis, machine learning, 
robotics, cognitive science and artificial intelligence. This Review provides an introduction to this framework, and dis- 
cusses some of the state-of-the-art advances in the field, namely, probabilistic programming, Bayesian optimization, 


data compression and automatic model discovery. 


he key idea behind the probabilistic framework to machine learn- 

ing is that learning can be thought of as inferring plausible models 

to explain observed data. A machine can use such models to make 
predictions about future data, and take decisions that are rational given 
these predictions. Uncertainty plays a fundamental part in all of this. 
Observed data can be consistent with many models, and therefore which 
model is appropriate, given the data, is uncertain. Similarly, predictions 
about future data and the future consequences of actions are uncertain. 
Probability theory provides a framework for modelling uncertainty. 

This Review starts with an introduction to the probabilistic approach 
to machine learning and Bayesian inference, and then discusses some of 
the state-of-the-art advances in the field. Many aspects of learning and 
intelligence crucially depend on the careful probabilistic representation of 
uncertainty. Probabilistic approaches have only recently become a main- 
stream approach to artificial intelligence’, robotics’ and machine learn- 
ing**. Even now, there is controversy in these fields about how important 
itis to fully represent uncertainty. For example, advances using deep neural 
networks to solve challenging pattern-recognition problems such as speech 
recognition’, image classification®’, and prediction of words in text, do not 
overtly represent the uncertainty in the structure or parameters of those 
neural networks. However, my focus will not be on these types of pattern- 
recognition problems, characterized by the availability of large amounts 
of data, but on problems for which uncertainty is really a key ingredient, 
for example where a decision may depend on the amount of uncertainty. 
[highlight five areas of current research at the frontier of probabilistic 

machine learning, emphasizing areas that are of broad relevance to sci- 
entists across many fields: probabilistic programming, which is a general 
framework for expressing probabilistic models as computer programs 
and which could have a major impact on scientific modelling; Bayes- 
ian optimization, which is an approach to globally optimizing unknown 
functions; probabilistic data compression; automating the discovery of 
plausible and interpretable models from data; and hierarchical modelling 
for learning many related models, for example for personalized medicine 
or recommendation. Although considerable challenges remain, the com- 
ing decade promises substantial advances in artificial intelligence and 
machine learning based on the probabilistic framework. 


Probabilistic modelling and representing uncertainty 
At the most basic level, machine learning seeks to develop methods for 
computers to improve their performance at certain tasks on the basis of 


observed data. Typical examples of such tasks might include detecting 
pedestrians in images taken from an autonomous vehicle, classifying 
gene-expression patterns from leukaemia patients into subtypes by clin- 
ical outcome, or translating English sentences into French. However, as 
I discuss, the scope of machine-learning tasks is even broader than these 
pattern classification or mapping tasks, and can include optimization 
and decision making, compressing data and automatically extracting 
interpretable models from data. 

Data are the key ingredients of all machine-learning systems. But 
data, even so-called big data, are useless on their own until one extracts 
knowledge or inferences from them. Almost all machine-learning 
tasks can be formulated as making inferences about missing or latent 
data from the observed data — I will variously use the terms inference, 
prediction or forecasting to refer to this general task. Elaborating the 
example mentioned, consider classifying people with leukaemia into 
one of the four main subtypes of this disease on the basis of each person’s 
measured gene-expression patterns. Here, the observed data are pairs of 
gene-expression patterns and labelled subtypes, and the unobserved or 
missing data to be inferred are the subtypes for new patients. To make 
inferences about unobserved data from the observed data, the learning 
system needs to make some assumptions; taken together these assump- 
tions constitute a model. A model can be very simple and rigid, such as a 
classic statistical linear regression model, or complex and flexible, such 
asa large and deep neural network, or even a model with infinitely many 
parameters. I return to this point in the next section. A model is con- 
sidered to be well defined if it can make forecasts or predictions about 
unobserved data having been trained on observed data (otherwise, if 
the model cannot make predictions it cannot be falsified, in the sense 
of the philosopher Karl Popper’s proposal for evaluating hypotheses, or 
as the theoretical physicist Wolfgang Pauli said the model is “not even 
wrong”). For example, in the classification setting, a well-defined model 
should be able to provide predictions of class labels for new patients. 
Since any sensible model will be uncertain when predicting unobserved 
data, uncertainty plays a fundamental part in modelling. 

There are many forms of uncertainty in modelling. At the lowest 
level, model uncertainty is introduced from measurement noise, for 
example, pixel noise or blur in images. At higher levels, a model may 
have many parameters, such as the coefficients of a linear regression, 
and there is uncertainty about which values of these parameters will 
be good at predicting new data. Finally, at the highest levels, there is 
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often uncertainty about even the general structure of the model: is linear 
regression or a neural network appropriate, if the latter, how many layers 
should it have, and so on. 

The probabilistic approach to modelling uses probability theory to 
express all forms of uncertainty”. Probability theory is the mathematical 
language for representing and manipulating uncertainty"’, in much the 
same way as calculus is the language for representing and manipulating 
rates of change. Fortunately, the probabilistic approach to modelling is 
conceptually very simple: probability distributions are used to represent 
all the uncertain unobserved quantities in a model (including structural, 
parametric and noise-related) and how they relate to the data. Then the 
basic rules of probability theory are used to infer the unobserved quan- 
tities given the observed data. Learning from data occurs through the 
transformation of the prior probability distributions (defined before 
observing the data), into posterior distributions (after observing data). 
The application of probability theory to learning from data is called 
Bayesian learning (Box 1). 

Apart from its conceptual simplicity, there are several appealing prop- 
erties of the probabilistic framework for machine intelligence. Simple 
probability distributions over single or a few variables can be com- 
posed to form the building blocks of larger, more complex models. The 
dominant paradigm in machine learning over the past two decades for 
representing such compositional probabilistic models has been graphi- 
cal models”, with variants including directed graphs (also known as 
Bayesian networks and belief networks), undirected graphs (also known 
as Markov networks and random fields), and mixed graphs with both 
directed and undirected edges (Fig. 1). As discussed later, probabilistic 
programming offers an elegant way of generalizing graphical models, 
allowing a much richer representation of models. The compositionality 
of probabilistic models means that the behaviour of these building blocks 
in the context of the larger model is often much easier to understand 
than, say, what will happen if one couples a non-linear dynamical system 
(for example, a recurrent neural network) to another. In particular, for 
a well-defined probabilistic model, it is always possible to generate data 
from the model; such ‘imaginary’ data provide a window into the ‘mind’ 
of the probabilistic model, helping us to understand both the initial prior 
assumptions and what the model has learned at any later stage. 

Probabilistic modelling also has some conceptual advantages over 
alternatives because it is a normative theory for learning in artificially 
intelligent systems. How should an artificially intelligent system represent 
and update its beliefs about the world in light of data? The Cox axioms 
define some desiderata for representing beliefs; a consequence of these 
axioms is that ‘degrees of belief, ranging from ‘impossible to ‘absolutely 
certain, must follow all the rules of probability theory'”’*”. This justifies 
the use of subjective Bayesian probabilistic representations in artificial 
intelligence. An argument for Bayesian representations in artificial intel- 
ligence that is motivated by decision theory is given by the Dutch book 
theorem. The argument rests on the idea that the strength of beliefs of an 
agent can be assessed by asking the agent whether it would be willing to 
accept bets at various odds (ratios of payoffs). The Dutch book theorem 
states that unless an artificial intelligence system's (or human's, for that 
matter) degrees of beliefs are consistent with the rules of probability it 
will be willing to accept bets that are guaranteed to lose money". Because 
of the force of these and many other arguments on the importance of a 
principled handling of uncertainty for intelligence, Bayesian probabilistic 
modelling has emerged not only as the theoretical foundation for ration- 
ality in artificial intelligence systems, but also as a model for normative 
behaviour in humans and animals’*'* (but see refs 19, 20 for a discussion), 
and much research is devoted to understanding how neural circuitry may 
be implementing Bayesian inference”. 

Although conceptually simple, a fully probabilistic approach to 
machine learning poses a number of computational and modelling chal- 
lenges. Computationally, the main challenge is that learning involves mar- 
ginalizing (summing out) all the variables in the model except for the 
variables of interest (Box 1). Such high-dimensional sums and integrals 
are generally computationally hard, in the sense that for many models 
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BOX 1 
Bayesian machine learning 


There are two simple rules that underlie probability theory: the sum 
rule: 


P(x) = DI PY) 
yey 
and the product rule: 
Py) =PQ)PY | x). 


Here x and y correspond to observed or uncertain quantities, taking 
values in some sets X and Y, respectively. For example, x and y might 
relate to the weather in Cambridge and London, respectively, both 
taking values in the set X= Y={rainy,cloudy,sunny}. P(x) corresponds 
to the probability of x, which can be either a statement about the 
frequency of observing a particular value, or a subjective belief about 
it. Py) is the joint probability of observing x and y, and P(y|x) is the 
probability of y conditioned on observing the value of x. The sum rule 
states that the marginal of x is obtained by summing (or integrating 
for continuous variables) the joint over y. The product rule states that 
the joint can be decomposed as the product of the marginal and the 
conditional. Bayes rule is a corollary of these two rules: 


_ PKK LY)PY) _ PHL WPY) 
hme) > PY) 


yeY 


We can apply probability theory to machine learning by replacing 
the symbols above: we replace x by D to denote the observed data, 
we replace y by 6 to denote the unknown parameters of a model, and 
we condition all terms on m, the class of probabilistic models we are 
considering. For learning, we thus get: 


P(D | 8,m)P(@ | m) 
P(D|m) 


P(@|D,m)= 


where P(D|6,m) is the likelihood of parameters 8 in model m, 
P(8@|m) is the prior probability of @ and P(6|D, m) is the posterior of 8 
given data D. 

For example, the data D might be a time series of hourly 
observations of the weather in Cambridge and London, and the 
model might attempt to capture the joint weather patterns at both 
locations over successive hours, with parameters 6 modelling 
correlations over time and space. Learning is the transformation 
of prior knowledge or assumptions about the parameters P(6|m), 
through the data D, into posterior knowledge about the parameters, 
P(8|D,m). This posterior is now the prior to be used for future data. 
A learned model can be used to predict or forecast new unseen test 
data, D,,., by simply applying the sum and product rule to get the 
prediction: 


P(Diest |D,m)= [ P(Deest | 8,D,m)P(8 | D,m) dd 


Finally, different models can be compared by applying Bayes rule 
at the level of m: 


P(D | m)P(m) 
P(D) 
P(D | m)= [ P(D | 6,m)P@ | m)do 


P(m | D)= 


The term P(D|m) is the marginal likelihood or model evidence, 
and implements a preference for simpler models known as 
Bayesian Ockham’s razor 7°97, 
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P(gen pred=T)=10 
P(rare disease =T | gen pred=T)=0.1 =p 
P(rare disease=T | gen pred=F)=10° 


! 
& 


P(symptom=T | rare disease =T)=0.8 P(gen marker=T | gen pred=T)=0.8 


P(symptom=T | rare disease =F)=0.01 P(gen marker=T | gen pred=F)=0.01 


Figure 1 | Bayesian inference. A simple example of Bayesian inference 
applied to a medical diagnosis problem. Here the problem is diagnosing a rare 
disease using information from the patient's symptoms and, potentially, the 
patient's genetic marker measurements, which indicate predisposition (gen 
pred) to this disease. In this example, all variables are assumed to be binary. 

T,, true; F false. The relationships between variables are indicated by directed 
arrows and the probability of each variable given other variables they directly 
depend on is also shown. Yellow nodes denote measurable variables, whereas 
green nodes denote hidden variables. Using the sum rule (Box 1), the prior 
probability of the patient having the rare disease is: P(rare disease = T) = P(rare 
disease = T|gen pred=T) p(gen pred=T) + p(rare disease = T|gen pred =F) p(gen 
pred =F)=1.1x 10~. Applying Bayes rule we find that for a patient observed 
to have the symptom, the probability of the rare disease is: p(rare disease= 
T|symptom =T) =8.8 x 10 *, whereas for a patient observed to have the genetic 
marker (gen marker) it is p(rare disease = T|gen marker =T) =7.9 x 10“. 
Assuming that the patient has both the symptom and the genetic marker the 
probability of the rare disease increases to p(rare disease = T|symptom =T, 
gen marker =T) =0.06. Here, we have shown fixed, known model parameters, 
that is, the numbers @=(10~, 0.1, 10°°, 0.8, 0.01, 0.8, 0.01). However, both these 
parameters and the structure of the model (the presence or absence of arrows 
and additional hidden variables) could be learned from a data set of patient 
records using the methods in Box 1. 


there is no known polynomial time algorithm for performing them 
exactly. Fortunately, a number of approximate integration algorithms have 
been developed, including Markov chain Monte Carlo (MCMC) meth- 
ods, variational approximations, expectation propagation and sequential 
Monte Carlo”. It is worth noting that computational techniques are one 
area in which Bayesian machine learning differs from much of the rest 
of machine learning: for Bayesian researchers the main computational 
problem is integration, whereas for much of the rest of the community the 
focus is on optimization of model parameters. However, this dichotomy 
is not as stark as it appears: many gradient-based optimization meth- 
ods can be turned into integration methods through the use of Langevin 
and Hamiltonian Monte Carlo methods”, while integration problems 
can be turned into optimization problems through the use of variational 
approximations™. I revisit optimization in a later section. 

The main modelling challenge for probabilistic machine learning is 
that the model should be flexible enough to capture all the properties of 
the data required to achieve the prediction task of interest. One approach 
to addressing this challenge is to develop a prior distribution that encom- 
passes an open-ended universe of models that can adapt in complexity to 
the data. The key statistical concept underlying flexible models that grow 
in complexity with the data is non-parametrics. 


Flexibility through non-parametrics 

One of the lessons of modern machine learning is that the best predic- 
tive performance is often obtained from highly flexible learning systems, 
especially when learning from large data sets. Flexible models can make 
better predictions because to a greater extent they allow data to ‘speak 
for themselves. (But note that all predictions involve assumptions and 
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therefore the data are never exclusively ‘speaking for themselves.) There 
are essentially two ways of achieving flexibility. The model could have 
a large number of parameters compared with the data set (for exam- 
ple, the neural network recently used to achieve translations of English 
and French sentences that approached the accuracy of state-of-the-art 
methods is a probabilistic model with 384 million parameters”). Alter- 
natively, the model can be defined using non-parametric components. 

The best way to understand non-parametric models is through com- 
parison to parametric ones. In a parametric model, there are a fixed, 
finite number of parameters, and no matter how much training data 
are observed, all the data can do is set these finitely many parameters 
that control future predictions. By contrast, non-parametric approaches 
have predictions that grow in complexity with the amount of training 
data, either by considering a nested sequence of parametric models with 
increasing numbers of parameters or by starting out with a model with 
infinitely many parameters. For example, in a classification problem, 
whereas a linear (parametric) classifier will always make predictions 
using a linear boundary between classes, a non-parametric classifier can 
learn a non-linear boundary whose shape becomes more complex with 
more data. Many non-parametric models can be derived starting from 
a parametric model and considering what happens as the model grows 
to the limit of infinitely many parameters”. Clearly, fitting a model with 
infinitely many parameters to finite training data would result in ‘over- 
fitting; in the sense that the model's predictions might reflect quirks 
of the training data rather than regularities that can be generalized to 
test data. Fortunately, Bayesian approaches are not prone to this kind 
of overfitting since they average over, rather than fit, the parameters 
(Box 1). Moreover, for many applications we have such huge data sets 
that the main concern is underfitting from the choice of an overly sim- 
plistic parametric model, rather than overfitting. 

A full discussion of Bayesian non-parametrics is outside the scope of 
this Review (see refs 9, 31, 32 for this), but it is worth mentioning a few 
of the key models. Gaussian processes are a very flexible non-paramet- 
ric model for unknown functions, and are widely used for regression, 
classification, and many other applications that require inference on 
functions’. Consider learning a function that relates the dose of some 
chemical to the response of an organism to that chemical. Instead of 
modelling this relationship with, say, a linear parametric function, a 
Gaussian process could be used to learn directly a non-parametric dis- 
tribution of non-linear functions consistent with the data. A notable 
example ofa recent application of Gaussian processes is GaussianFace, a 
state-of-the-art approach to face recognition that outperforms humans 
and deep-learning methods™. Dirichlet processes are a non-paramet- 
ric model with a long history in statistics* and are used for density 
estimation, clustering, time-series analysis and modelling the topics 
of documents”. To illustrate Dirichlet processes, consider an applica- 
tion to modelling friendships in a social network, where each person 
can belong to one of many communities. A Dirichlet process makes it 
possible to have a model whereby the number of inferred communities 
(that is, clusters) grows with the number of people”. Dirichlet processes 
have also been used for clustering gene-expression patterns”. The 
Indian buffet process (IBP)*’ is a non-parametric model that can be 
used for latent feature modelling, learning overlapping clusters, sparse 
matrix factorization, or to non-parametrically learn the structure of 
a deep network". Elaborating the social network modelling example, 
an IBP-based model allows each person to belong to some subset of a 
large number of potential communities (for example, as defined by dif- 
ferent families, workplaces, schools, hobbies, and so on) rather than a 
single community, and the probability of friendship between two people 
depends on the number of overlapping communities they have”. In this 
case, the latent features of each person correspond to the communities, 
which are not assumed to be observed directly. The IBP can be thought 
of as a way of endowing Bayesian non-parametric models with ‘distrib- 
uted representations, as popularized in the neural network literature”. 
An interesting link between Bayesian non-parametrics and neural net- 
works is that, under fairly general conditions, a neural network with 
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statesmean = [-1, 1, 0] # Emission parameters. 


initial = Categorical([1.0/3, 1.0/3, 1.0/3]) # Prob distr of state[1]. 
trans = [Categorical([@.1, @.5, @.4]), Categorical([®.2, 0.2, @.6]), 
Categorical([@.15, 0.15, @.7])]  # Trans distr for each state. 


data = [Nil, @.9, 0.8, @.7, @, -@.025, -5, -2, -@.1, @, 0.13] 


@model hmm begin # Define a model hmm. 
states = Array(Int, length(data)) 
@assume(states[1] ~ initial) 
for i = 2:length(data) 
@assume(states[i] ~ trans[states[i-1]]) 
@observe(data[i] ~ Normal(statesmean[states[i]], 9.4)) 
end 
@predict states 
end 
Figure 2 | Probabilistic programming. A probabilistic program in Julia 
(left) defining a simple three-state hidden Markov model (HMM), inspired 
by an example in ref. 62. The HMM isa widely used probabilistic model for 
sequential and time-series data, which assumes the data were obtained by 
transitioning stochastically between a discrete number of hidden states”. 
The first four lines define the model parameters and the data. Here ‘trans’ is 
the 3 x3 state-transition matrix, ‘initial’ is the initial state distribution, and 
‘statesmean are the mean observations for each of the three states; actual 
observations are assumed to be noisy versions of this mean with Gaussian 
noise. The function hmm starts the definition of the HMM, drawing the 


infinitely many hidden units is equivalent to a Gaussian process“. Note 
that the above non-parametric components should be thought of again 
as building blocks, which can be composed into more complex models 
as described earlier. The next section describes an even more power- 
ful way of composing models — through probabilistic programming. 


Probabilistic programming 

The basic idea in probabilistic programming is to use computer pro- 
grams to represent probabilistic models (http://probabilistic-program- 
ming.org)***”. One way to do this is for the computer program to define 
a generator for data from the probabilistic model, that is, a simulator 
(Fig. 2). This simulator makes calls to a random number generator in 
such a way that repeated runs from the simulator would sample different 
possible data sets from the model. This simulation framework is more 
general than the graphical model framework described previously since 
computer programs can allow constructs such as recursion (functions 
calling themselves) and control flow statements (for example, ‘if’ state- 
ments that result in multiple paths a program can follow), which are 
difficult or impossible to represent in a finite graph. In fact, for many 
of the recent probabilistic programming languages that are based on 
extending Turing-complete languages (a class that includes almost all 
commonly used languages), it is possible to represent any computable 
probability distribution as a probabilistic program™. 

The full potential of probabilistic programming comes from automat- 
ing the process of inferring unobserved variables in the model condi- 
tioned on the observed data (Box 1). Conceptually, conditioning needs 
to compute input states of the program that generate data matching the 
observed data. Whereas normally we think of programs running from 
inputs to outputs, conditioning involves solving the inverse problem of 
inferring the inputs (in particular the random number calls) that match 
a certain program output. Such conditioning is performed by a ‘univer- 
sal inference engine; usually implemented by Monte Carlo sampling 
over possible executions of the simulator program that are consistent 
with the observed data. The fact that defining such universal inference 
algorithms for computer programs is even possible is somewhat surpris- 
ing, but it is related to the generality of certain key ideas from sampling 
such as rejection sampling, sequential Monte Carlo methods” and 
‘approximate Bayesian computation”. 

As an example, imagine you write a probabilistic program that simu- 
lates a gene regulatory model that relates unmeasured transcription 


trans 


initial 


statesmean 


sequence of states with the @assume statements, and conditioning on the 
observed data with the @observe statements. Finally @predict states that we 
wish to infer the states and data; this inference is done automatically by the 
universal inference engine, which reasons over the configurations of this 
computer program. It would be trivial to modify this program so that the 
HMM parameters are unknown rather than fixed. A graphical model (right) 
corresponding to the HMM probabilistic program showing dependencies 
between the parameters (blue), hidden state variables (green) and observed 
data (yellow). This graphical model highlights the compositional nature of 
probabilistic models. 


factors to the expression levels of certain genes. Your uncertainty in each 
part of the model would be represented by the probability distributions 
used in the simulator. The universal inference engine can then condition 
the output of this program on the measured expression levels, and auto- 
matically infer the activity of the unmeasured transcription factors and 
other uncertain model parameters. Another application of probabilistic 
programming implements a computer vision system as the inverse of a 
computer graphics program”. 

There are several reasons why probabilistic programming could 
prove to be revolutionary for machine intelligence and scientific mod- 
elling (its potential has been noticed by US Defense Advanced Research 
Projects Agency, which is currently funding a major programme called 
Probabilistic Programming for Advancing Machine Learning). First, 
the universal inference engine obviates the need to manually derive 
inference methods for models. Since deriving and implementing 
inference methods is generally the most rate-limiting and bug-prone 
step in modelling, often taking months, automating this step so that 
it takes minutes or seconds will greatly accelerate the deployment of 
machine learning systems. Second, probabilistic programming could 
be potentially transformative for the sciences, since it allows for rapid 
prototyping and testing of different models of data. Probabilistic pro- 
gramming languages create a very clear separation between the model 
and the inference procedures, encouraging model-based thinking”. 
There are a growing number of probabilistic programming languages. 
BUGS™, Stan*’, AutoBayes” * and Infer.NET” allow only a restrictive 
class of models to be represented compared with systems based on 
Turing-complete languages. In return for this restriction, inference 
in such languages can be much faster than for the more general lan- 
guages” ® such as IBAL”, BLOG*’, Church”, Figaro”, Venture”, and 
Anglican”. A major emphasis of recent work is on fast inference in 
general languages (see for example ref. 63). Nearly all approaches to 
probabilistic programming are Bayesian since it is hard to create other 
coherent frameworks for automated reasoning about uncertainty. 
Notable exceptions are systems such as Theano, which is not itself a 
probabilistic programming language but uses symbolic differentia- 
tion to speed up and automate optimization of parameters of neural 
networks and other probabilistic models™. 

Although parameter optimization is commonly used to improve 
probabilistic models, in the next section I will describe recent work 
on how probabilistic modelling can be used to improve optimization. 
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Figure 3 | Bayesian optimization. A simple illustration of Bayesian 
optimization in one dimension. The goal is to maximize some true 
unknown function f (not shown). Information about this function is 
gained by making observations (circles, top), which are evaluations of 

the function at specific x values. These observations are used to infer a 
posterior distribution over the function values (shown as mean, blue line; 
and standard deviations, blue shaded area) representing the distribution of 
possible functions; note that uncertainty grows away from the observations. 
On the basis of this distribution over functions, an acquisition function is 


Bayesian optimization 

Consider the very general problem of finding the global maximum of an 
unknown function that is expensive to evaluate (say, evaluating the func- 
tion requires performing lots of computation or conducting an experi- 
ment). Mathematically, for a function fon a domain X, the goal is to find 
a global maximizer x*: 


x =arg max f(x). 
xeX 


Bayesian optimization poses this as a problem in sequential deci- 
sion theory: where should one evaluate next so as most quickly to 
maximize f, taking into account the gain in information about the 
unknown function f (refs 65,66)? For example, having evaluated at 
three points, measuring the corresponding values of the function at 
those points, [(x,, f(x,)), (x2,f(x2)),(x3f(x3))], which point x should 
the algorithm evaluate next, and where does it believe the maximum 
to be? This is a classic machine-intelligence problem with a wide 
range of applications in science and engineering, from drug design 
(where the function could be the drug’s efficacy) to robotics (where 
the function could be the speed of a robot’s gait). It can be applied 
to any problem involving the optimization of expensive functions; 
the qualifier ‘expensive comes because Bayesian optimization might 
use substantial computational resources to decide where to evaluate 
next, and a trade-off for these resources has to be made with the cost 
of function evaluations. 

The current best-performing global optimization methods maintain a 
Bayesian representation of the probability distribution over the uncertain 
function f being optimized, and use this uncertainty to decide where 
(in X) to query next” ©. In continuous spaces, most Bayesian optimiza- 
tion methods (Fig. 3) use Gaussian processes (as described in the section 
on non-parametrics) to model the unknown function. A recent high- 
impact application has been the optimization of the training process for 
machine-learning models, including deep neural networks”. This and 
related recent work” are further examples of the application of machine 
intelligence to improve machine intelligence. 

There are interesting links between Bayesian optimization and rein- 
forcement learning. Specifically, Bayesian optimization is a sequential 
decision problem where the decisions (choices of x to evaluate) do 
not affect the state of the system (the actual function f). Such state-less 
sequential decision problems fall under the rubric of multi-arm bandits”, 
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computed (green shaded area, bottom panels), which represents the gain 
from evaluating the unknown function fat different x values; note that the 
acquisition function is high where the posterior over f has both high mean 
and large uncertainty. Different acquisition functions can be used such as 
‘expected improvement or ‘information gain. The peak of the acquisition 
function (red line) is the best next point to evaluate, and is therefore chosen 
for evaluation (red dot, new observation). The left and right panels show 
an example of what could happen after three and four function evaluations, 
respectively. 


a subclass of reinforcement-learning problems. More broadly, important 
recent work takes a Bayesian approach to learning to control uncertain 
systems” (for a review see ref. 74). Faithfully representing uncertainty 
about the future outcome of actions is particularly important in decision 
and control problems. Good decisions rely on good representations of 
the probability of different outcomes and their relative payoffs. 

More generally, Bayesian optimization is a special case of Bayesian 
numerical computation”*”’, which is re-emerging as a very active area of 
research (http://www.probabilistic-numerics.org), and includes topics 
such as solving ordinary differential equations and numerical integration. 
In all these cases, probability theory is being used to represent computa- 
tional uncertainty; that is, the uncertainty that one has about the outcome 
of a deterministic computation. 


Data compression 

Consider the problem of compressing data so as to communicate them 
or store them in as few bits as possible in such a manner that the original 
data can be recovered exactly from the compressed data. Methods for such 
lossless data compression are ubiquitous in information technology, from 
computer hard drives to data transfer over the internet. Data compression 
and probabilistic modelling are two sides of the same coin, and Bayesian 
machine-learning methods are increasingly advancing the state-of-the-art 
in compression. The connection between compression and probabilistic 
modelling was established in the mathematician Claude Shannon's semi- 
nal work on the source coding theorem”, which states that the number 
of bits required to compress data in a lossless manner is bounded by the 
entropy of the probability distribution of the data. All commonly used 
lossless data compression algorithms (for example, gzip) can be viewed 
as probabilistic models of sequences of symbols. 

The link to Bayesian machine learning is that the better the 
probabilistic model one learns, the higher the compression rate 
can be”*. These models need to be flexible and adaptive, since dif- 
ferent kinds of sequences have very different statistical patterns 
(say, Shakespeare’s plays or computer source code). It turns out 
that some of the world’s best compression algorithms (for example, 
Sequence Memoizer” and PPM with dynamic parameter updates*’) 
are equivalent to Bayesian non-parametric models of sequences, 
and improvements to compression are being made through a better 
understanding of how to learn the statistical structure of sequences. 
Future advances in compression will come with advances in 
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probabilistic machine learning, including special compression 
methods for non-sequence data such as images, graphs and other 
structured objects. 


Automatic discovery of interpretable models from data 

One of the grand challenges of machine learning is to fully automate the 
process of learning and explaining statistical models from data. This is the 
goal of the Automatic Statistician (http://www.automaticstatistician.com), 
a system that can automatically discover plausible models from data, and 
explain what it has discovered in plain English*’. This could be useful 
to almost any field of endeavour that is reliant on extracting knowledge 
from data. In contrast to the methods described in much of the machine- 
learning literature, which have been focused on extracting increasing 
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performance improvements on pattern-recognition problems using 
techniques such as kernel methods, random forests or deep learning, the 
Automatic Statistician builds models that are composed of interpretable 
components, and has a principled way of representing uncertainty about 
model structures given the data. It also gives reasonable answers not just 
for big data sets, but also for small ones. Bayesian approaches provide an 
elegant way of trading off the complexity of the model and the complexity 
of the data, and probabilistic models are compositional and interpretable, 
as already described. 

A prototype version of the Automatic Statistician takes in time-series 
data and automatically generates 5-15 page reports describing the model 
it has discovered (Fig. 4). This system is based on the idea that proba- 
bilistic building blocks can be combined through a grammar to build 


This component is approximately periodic with a period of 10.7 years. Across periods the shape 
of this function varies smoothly with a typical length scale of 34.4 years. The shape of this 
function within each period is very smooth and resembles a sinusoid. This component applies 
until 1643 and from 1716 onwards. This component explains 72.4% of the residual variance; 
this increases the total variance explained from 73.0% to 92.5%. The addition of this component 
reduces the cross validated MAE by 16.67% from 0.18 to 0.15. 


Sum of components up to component 4 


136.2 
1361.5 
136.1 t 


1360.5 {Mi - 


136.0 


Fee 


2010 


2000 


1970 1980 1990 


1362 5 


136155 


1361 4. 


1360.5 |". ° 


1360 : : 1 - 
1750 1800 1850 


1650 1700 1900 1950 2000 2050 


SE PER 


Lin + Per Lin x Per 


(li) SSE cr Sie 


Figure 4 | The Automatic Statistician. A flow diagram describing the 
Automatic Statistician. The input to the system is data, in this case 
represented as time series (top left). The system searches over a grammar 
of models to discover a good interpretation of the data (bottom left), using 
Bayesian inference to score the models (Box 1). Components of the model 
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Figure 8: Pointwise posterior of component 4 (left) and the posterior of the cumulative sum of 
components with data (right) 
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discovered are translated into English phrases (bottom right). The end 
result is a report with text, figures and tables, describing in detail what has 
been inferred about the data, including a section on model checking and 
criticism (top right)". Data and the report are for illustrative purposes 
only. 
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an open-ended language of models™. In contrast to work on equation 
learning (see for example ref. 83), the models attempt to capture general 
properties of functions (for example, smoothness, periodicity or trends) 
rather than a precise equation. Handling uncertainty is at the core of the 
Automatic Statistician; it makes use of Bayesian non-parametrics to give 
it the flexibility to obtain state-of-the-art predictive performance, and 
uses the metric marginal likelihood (Box 1) to search the space of models. 

Important earlier work includes statistical expert systems**** and the 
Robot Scientist, which integrates machine learning and scientific discov- 
ery ina closed loop with an experimental platform in microbiology to 
automate the design and execution of new experiments**. Auto-WEKA 
is a recent project that automates learning classifiers, making heavy use of 
the Bayesian optimization techniques already described”. Efforts to auto- 
mate the application of machine-learning methods to data have recently 
gained momentum, and may ultimately result in artificial intelligence 
systems for data science. 


Perspective 
The information revolution has resulted in the availability of ever larger 
collections of data. What is the role of uncertainty in modelling such big 
data? Classic statistical results state that under certain regularity condi- 
tions, in the limit oflarge data sets the posterior distribution of the param- 
eters for Bayesian parametric models converges to a single point around 
the maximum likelihood estimate. Does this mean that Bayesian proba- 
bilistic modelling of uncertainty is unnecessary if you have a lot of data? 
There are at least two reasons this is not the case*’. First, as we have 
seen, Bayesian non-parametric models have essentially infinitely many 
parameters, so no matter how many data one has, their capacity to learn 
should not saturate, and their predictions should continue to improve. 
Second, many large data sets are in fact large collections of small data 
sets. For example, in areas such as personalized medicine and recom- 
mendation systems, there might be a large amount of data, but there is still 
a relatively small amount of data for each patient or client, respectively. 
To customize predictions for each person it becomes necessary to builda 
model for each person — with its inherent uncertainties — and to couple 
these models together in a hierarchy so that information can be borrowed 
from other similar people. We call this the personalization of models, and 
it is naturally implemented using hierarchical Bayesian approaches such 
as hierarchical Dirichlet processes”, and Bayesian multitask learning*””. 
Probabilistic approaches to machine learning and intelligence are a very 
active area of research with wide-ranging impact beyond conventional 
pattern-recognition problems. As I have outlined, these problems include 
data compression, optimization, decision making, scientific model dis- 
covery and interpretation, and personalization. The key distinction 
between problems in which a probabilistic approach is important and 
problems that can be solved using non-probabilistic machine-learning 
approaches is whether uncertainty has a central role. Moreover, most 
conventional optimization-based machine-learning approaches have 
probabilistic analogues that handle uncertainty in a more principled 
manner. For example, Bayesian neural networks represent the parameter 
uncertainty in neural networks“, and mixture models are a probabilistic 
analogue for clustering methods”. Although probabilistic machine learn- 
ing often defines how to solve a problem in principle, the central chal- 
lenge in the field is finding how to do so in practice in a computationally 
efficient manner””". There are many approaches to the efficient approxi- 
mation of computationally hard inference problems. Modern inference 
methods have made it possible to scale to millions of data points, making 
probabilistic methods computationally competitive with conventional 
methods” *». Ultimately, intelligence relies on understanding and acting 
in an imperfectly sensed and uncertain world. Probabilistic modelling will 
continue to play a central part in the development of ever more powerful 
machine learning and artificial intelligence systems. m 
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Science, technology and the future 
of small autonomous drones 


Dario Floreano! & Robert J. Wood? 


We are witnessing the advent of a new era of robots — drones — that can autonomously fly in natural and man-made 
environments. These robots, often associated with defence applications, could have a major impact on civilian tasks, 
including transportation, communication, agriculture, disaster mitigation and environment preservation. Autonomous 
flight in confined spaces presents great scientific and technical challenges owing to the energetic cost of staying airborne 
and to the perceptual intelligence required to negotiate complex environments. We identify scientific and technological 
advances that are expected to translate, within appropriate regulatory frameworks, into pervasive use of autonomous 


drones for civilian applications. 


micro aerial vehicles, often conjure up images of unmanned aer- 

oplanes that fly thousands of miles for espionage and to deploy 
munitions. However, over the past few years, an increasing number of 
public and private research laboratories have been working on small, 
human-friendly drones that one day may autonomously fly in confined 
spaces and in close proximity to people. The development of these small 
drones, which is the main focus of this Review, has been supported by the 
miniaturization and cost reduction of electronic components (micro- 
processors, sensors, batteries and wireless communication units), largely 
driven by the portable electronic device industry. These improvements 
have enabled the prototyping and commercialization of small (typically 
less than 1 kg) drones at smartphone prices. 

Small drones will have important socio-economic impacts (Fig. 1). 
Images from drones that are capable of flying a few metres above the 
ground will fill a gap between expensive, weather-dependent and low- 
resolution images provided by satellites and car-based images limited to 
human-level perspectives and the availability of accessible roads. Special- 
ized flying cameras and cloud-based data analytics will allow farmers to 
continuously monitor the quality of crop growth. Such platforms will 
enable construction companies to measure work progress in real time. 
Drones will let mining companies obtain precise volumetric data of exca- 
vations. Energy and infrastructure companies will be able to exhaustively 
survey pipelines, roads and cables. Humanitarian organizations could 
immediately assess and adapt aid efforts in continuously changing refugee 
camps. Transportation drones that are capable of safely taking off and 
landing in the proximity of buildings and humans will allow developing 
countries — without a suitable road network — to rapidly deliver goods 
and to finally unleash the full potential of their e-commerce telecommu- 
nication infrastructure. Transportation drones will also help developed 
countries to improve the quality of service in congested or remote areas, 
and will enable rescue organizations to quickly deliver medical supplies 
in the field and on demand. Inspection drones that are capable of flying in 
confined spaces will help fire-fighting and emergency units to assess dan- 
gers faster and more safely, logistic companies to detect cracks in the inner 
and outer shells of ships, road maintenance companies to measure signs 
of wear and tear in bridges and tunnels, security companies to improve 
building safety by monitoring areas outside the range of surveillance 
cameras, and disaster mitigation agencies to inspect partially collapsed 


D rones, a popular nickname for unmanned aerial vehicles and 


buildings where ground clutter is an obstacle for terrestrial robots. Coor- 
dinated teams of autonomous drones will enable missions that last longer 
than the flight time ofa single drone by allowing some drones to tempo- 
rarily leave the team for battery replacement. Drone teams will permit res- 
cue organizations to quickly deploy dedicated communication networks 
for ground operators. Telecommunication companies can also leverage 
drone networks to temporarily supplement or replace points of service. 
Realizing this vision will require new mechatronic solutions and new 
levels of control autonomy (Box 1) to safely complete missions in con- 
fined spaces and near the ground, where absolute positioning signals and 
remote control are not available or sufficiently precise. None of today’s 
commercial drones has sufficient control autonomy to complete any of 
the missions described without skilled human supervision, which makes 
those operations slow, dangerous and not scalable. We identify the most 
promising scientific and technological advances that could lead to a new 
generation of small autonomous drones and offer a tentative road map of 
capability deployment within suitable regulatory frameworks. 


Design and manufacturing challenges 

Flight is energetically expensive, particularly when the size of the device 
is reduced. This is often due to practical issues that arise when scal- 
ing a vehicle down, such as reduced power density of electromagnetic 
motors, decreased transmission efficiency owing to increased domi- 
nance of friction from gears and bearings, and greater viscous losses 
because of reduced Reynolds numbers. Certain flight modes are also 
challenging — for example, hovering — either owing to energetics or 
control; these also become increasingly difficult as size is reduced. Scaling 
issues create problems at the most basic level of autonomy: they limit the 
ability to simply sustain flight for an adequate amount of time to perform 
higher-level mission functions. To overcome these challenges, developers 
of drones must consider system-level design choices that balance trade- 
offs that arise when selecting the constituent components to power the 
vehicle. The most common morphologies for flying robots include more 
conventional fixed-wing and rotary-wing aircraft (rotorcraft) designs as 
well as bio-inspired designs based on flapping wings. Each of these plat- 
form types has pros and cons. For example, fixed-wing aircraft (Fig. 2a) are 
capable of fast and efficient flight, but typically cannot hover. Rotorcraft 
(Fig. 2b) can hover and are highly manoeuvrable, but are generally less 
efficient in forward flight than fixed-wing vehicles. Neither fixed-wing 
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Figure 1 | Autonomous drones in rescue situations. a, Fixed-wing drones with a long flight time could provide bird’s-eye-view images and a communication 
network for rescuers on the ground. b, Rotorcrafts with hovering capabilities could inspect structures for cracks and leaks; and c, transport medical supplies from 
nearby hospitals. d, Swarms of dispensable drones with flapping wings could enter buildings to search for chemical hazards. e, Multi-modal caged robots could fly 


and roll into complex structures to safely search for signs of life. 


craft nor rotorcraft scale down well — both in terms of the aerodynamics 
that govern flight and in the performance of the components that are nec- 
essary to generate propulsion. Bio-inspired flapping-wing flight (Fig. 2c) 
offers an alternative paradigm that can be scaled down in size, but brings 
fluid-mechanics-modelling and control challenges. 


Propulsion and manoeuvrability considerations 
Small-scale aerial vehicles are predominantly fixed-wing propeller-driven 
aircraft or rotorcraft. The latter can be further broken down into con- 
ventional primary and tail rotor configurations, coaxial dual rotors’, or 
quadcopter designs, which have recently become popular. For each of 
these configurations, and for propeller-driven aircraft at larger scales, 
there have been only small incremental improvements to propeller design 
over the past several decades, since the advent of standards and tools to 
generate airfoil shapes by the National Advisory Committee for Aeronaut- 
ics’. Propulsive efficiencies for rotorcraft degrade as the vehicle size is 
reduced; an indicator of the energetic challenges for flight at small scales. 
Smaller size typically implies lower Reynolds numbers, which in turn 
suggests an increased dominance of viscous forces, causing greater drag 
coefficients and reduced lift coefficients compared with larger aircraft. 
To put this into perspective, this means that a scaled-down fixed-wing 
aircraft would be subject to a lower lift-to-drag ratio and thereby require 
greater relative forward velocity to maintain flight, with the associated 
drag and power penalty reducing the overall energetic efficiency. The 
impacts of scaling challenges (Fig. 3) are that smaller drones have less 
endurance, and that the overall flight times range from tens of seconds to 
tens of minutes — unfavourable compared with human-scale vehicles. 
There are, however, manoeuvrability benefits that arise from decreased 
vehicle size. For example, the moment of inertia is a strong function of 
the vehicle's characteristic dimension — a measure of a critical length 
of the vehicle, such as the chord length of a wing or length of a propeller 
in a similar manner as used in Reynolds number scaling. Because the 
moment of inertia of the vehicle scales with the characteristic dimension, 
L, raised to the fifth power, a decrease in size froma 11m wingspan, four- 
seat aircraft such as the Cessna 172 to a 0.05 m rotor-to-rotor separation 
Blade Pico QX quadcopter implies that the Cessna has about 5 x 10"' the 
inertia of the quadcopter (with respect to roll). Depending on the scaling 
law used for torque generated by the quadcopter, this leads to angular 
acceleration that scales either by L“' or L~, both cases indicating that 
the quadcopter will be much more manoeuvrable’. This has resulted in 
remarkable demonstrations of aerobatic manoeuvres in rooms equipped 
with real-time tracking and control by external computers, including fly- 
ing through windowsat high speeds* and manipulating objects in flight’. 
This enhanced agility, often achieved at the expense of open-loop stability, 
requires increased emphasis on control — a challenge also exacerbated 
by the size, weight and power constraints of these small vehicles. Imple- 
menting these behaviours in autonomous drones (with on-board sensing 
and computation) is a significant challenge, but one that is beginning to 
have promising results®’ — even for aggressive flight manoeuvres. The 


challenges for further scaling down these sensing and control systems are 
discussed in the final section of this Review. 


Actuation, power and manufacturing 

Beyond autonomy considerations, the size and type of the flying robot 
also engender associated challenges with actuation and manufacturing. 
For actuation, rotorcraft and propeller-driven fixed-wing drones gener- 
ally use electromagnetic motors. Some flapping-wing aircraft*” also use 
electromagnetic motors, but require a linkage mechanism to convert the 
rotary motion of the motor to the flapping motion of the wings. However, 
as the scale is reduced, conventional motors become inefficient, require 
substantial gearing to achieve a desired wing or propeller velocity, and are 
extremely difficult to manufacture. Therefore, for vehicles below a few 
grams, alternative methods of actuation are required. Similarly, ‘macro 
size’ flying robots (tens to hundreds of grams and larger) may be con- 
structed using conventional methods such as additive and subtractive 
machining and ‘nuts-and-bolts’ assembly. However, for ‘micro size’ fly- 
ing robots (less than a few grams), manufacturing needs to be rethought 
and novel methods applied. To give a perspective on the challenges for 
actuation, we need to consider how the physics of scaling” affects the 
performance of electromagnetic motors. At macroscopic scales, motors 
used in electric cars can achieve an efficiency of nearly 90% with a power 
density of several kilowatts per kilogram whereas at millimetre scales, 
existing motors produce a few tens of watts per kilogram of power at less 
than 50% efficiency’'. This reduction in performance and the energetic 
expense of flight at small scales means that both power source and power 
distribution are important considerations”. For vehicles for which the pri- 
mary propulsion system is also responsible for generating control torques, 
this does not apply (for example, quadcopters). However, for fixed-wing 
aircraft and some flapping-wing vehicles, it is essential that most of the 
power budget must be allocated to staying aloft with a proportionately 
smaller power for actuating control surfaces. For example, nearly 90% of 
the power budget of the RoboBee (Fig. 2c) is dedicated to the primary 
power actuators that generate lift to stay in the air’*. As already discussed, 
for power actuators, the typical choice is electromagnetic motors. There 
are alternatives, including electroactive materials such as piezoelectric 
actuators” or ultrasonic motors’’, however, these are most appropriate 
for the smaller end of the size spectrum. Control actuators have more 
options at most scales, including voice coils, shape memory alloys'® and 
electroactive polymers”. 

Given the challenges for small-scale propulsion, an unsteady 
approach to force production is potentially a viable option for small 
vehicles. Unlike fixed-wing aircraft and rotorcraft that produce lift and 
thrust in a quasi-steady way (either directly in the case of rotorcraft 
or indirectly by the movement of air over the wings of a fixed-wing 
drone), the aerodynamics of flapping-wing drones is complex and 
involves the generation and manipulation of vortical structures in the 
air’®. As is obvious from its existence in nature, flapping-wing pro- 
pulsion is an effective form of locomotion, but translating this into 
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functional designs is not a trivial exercise given the morphological 
diversity of the flight apparatus of insects, bats and birds. Nonetheless, 
drone researchers have effectively reproduced natural flight in many 
different ways. Examples include robotic insects such as RoboBee” 
and the four-winged DelFly*, and bird-sized drones such as the Nano 
Hummingbird’ (Fig. 2c). 

As size is reduced, in addition to the challenges for actuator selection 
and manufacturing, there are also considerations of how to manufacture 
the entire vehicle. At scales for which electromagnetic actuation and 
bearing-based rotary joints are feasible, more conventional manufactur- 
ing methods are used, such as subtractive machining, additive printing 
and moulding of composite materials. At small scales — for example, for 
aircraft of comparable size to insects and small birds — some of these 
techniques fail, typically owing to limited resolution. Alternative meth- 
ods have been developed; for example, those based on folding (an inher- 
ently scalable technique) have been used to create insect-sized robots, 
avoiding the challenges that are inherent to macro-scale nuts-and-bolts 
approaches”. 


BOX1 
Control autonomy 


According to the definition by the International Organization 

for Standardization, robot autonomy is the ability to perform 
intended tasks based on current state and sensing, without 

human intervention®. This definition encompasses a wide 

range of situations, which demand different levels of autonomy 
depending on the type of robot and the intended use. For example, 
although autonomy in tethered robots does not concern energy 
management, mobile robots with long-range travel may require the 
capability to decide when to abort the current mission and locate a 
recharging station. In the case of the small drones discussed here, 
we can identify three levels of increasing autonomy (Table 1). 

@ Sensory-motor autonomy: translate high-level human commands 
(such as to reach a given altitude, perform circular trajectory, move 
to global positioning system (GPS) coordinates or maintain position) 
into combinations of platform-dependent control signals (such as 
pitch, roll, yaw angles or speed); follow pre-programmed trajectory 
using GPS waypoints. 

@ Reactive autonomy (requires sensory-motor autonomy): 
maintain current position or trajectory in the presence of external 
perturbations, such as wind or electro-mechanical failure; avoid 
obstacles; maintain a safe or predefined distance from ground; 
coordinate with moving objects, including other drones; take off 
and land. 

@ Cognitive autonomy (requires reactive autonomy): perform 
simultaneous localization and mapping; resolve conflicting 
information; plan (for battery recharge for example); recognize 
objects or persons; learn. 


Table 1 | Levels of autonomy: requirements, availability and 
readiness for market 


Exteroceptive | Computational Supervision Readiness Validated on 
sensors load required level drone type 
Sensory- Noneor few Little Yes Deployed All types 
motor 
autonomy 
Reactive Few and Medium Little Partly Fixed 
autonomy sparse deployed wing, 
rotorcraft 
and 
flapping 
wing 
Cognitive Severaland High None Not yet Mostly 
autonomy high density deployed _ rotorcraft 
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Multi-modal drones 

In many situations, such as search and rescue, parcel delivery in con- 
fined spaces and environmental monitoring, it may be advantageous to 
combine aerial and terrestrial capabilities. Perching mechanisms could 
allow drones to land on walls” and power lines” in order to monitor the 
environment froma high vantage point while saving energy. Agile drones 
could move on the ground by using legs in conjunction with retractable” 
or flapping wings™. In an effort to minimize the total cost of transport”, 
which will be increased by the additional locomotion mode, these future 
drones may benefit from using the same actuation system for flight con- 
trol and ground locomotion. The different speed and torque requirements 
of these two locomotion modes could be reconciled by adapting the wing 
morphology to the specific situation”, similar to the way vampire bats 
(Desmodus rotundus) use their powerful front limbs when flying or walk- 
ing”. Alternatively, multi-modal locomotion could be obtained by adding 
large wheels to the sides of hovering drones”, by embedding the propul- 
sion system in a rolling cage”, or by completely decoupling the spherical 
cage from the inner rotors by means of a gimbal system” (Fig. 2b); the lat- 
ter design allows the drone not only to roll on the ground in any direction, 
but, because the rotors are protected by a cage, also to safely collide with 
obstacles or humans without attitude perturbations. Conversely, wings 
could be added to ground-based robots travelling on rough terrain to 
extend their jumping distances, stabilize the landing phase and reduce the 
impact with the ground”’. In this case as well, the total cost of transport 
could be reduced by sharing the same actuation system between jumping 
and wing-deployment mechanisms”. Alternatively, one could use pivot- 
ing wings, which minimize drag at take-off, improve the transition from 
ballistic jump to gliding and maximize the gliding ratio”. In the future, 
we may also deploy drones in semi-aquatic environments by using design 
principles inspired by aquatic birds, such as folding wings for plunge div- 
ing or hydrophobic surfaces for dry flight, or by flying squid, which use 
water-jet propulsion for take off™*. 


Sensing and control 

Conventional unmanned drones that fly at high altitudes regulate their 
attitude (roll, pitch and yaw) and their position (x, y and z) (Fig. 2d) by 
continuously monitoring and merging data from an inertial measurement 
unit (IMU), which contains three-axis accelerometers and gyroscopes, 
and from a global positioning system (GPS). This technology, which is 
necessary for sensory-motor autonomy, is now also available in small 
drones for recreational or professional use. However, GPS information 
is not sufficiently precise for altitude regulation when flying a few metres 
above the ground and is not always available or reliable in confined areas, 
such as cities, forests and buildings. Even if reliable GPS information were 
available, it would need to be combined with a precise map of the drone’s 
surroundings in order to identify obstacle-free trajectories. However, 
digital maps, such as those used in car navigation systems, are restricted 
to traversable roads, do not include three-dimensional information (the 
height of natural structures, buildings, and bridges, and the presence of 
cables, poles, and so on) and are not refreshed frequently enough to cap- 
ture landscape modification. Therefore, small autonomous drones flying 
at low altitude will need more complex levels of control autonomy and 
additional sensors to detect distances from the surrounding environment 
and perform safe and stable trajectories. Vision is a promising sensor 
modality for small drones because compared with other distance sensors 
such as sonar, infrared and laser range finders used in terrestrial vehicles, 
it does not require energy to interrogate the environment, and for compa- 
rable mass it can gather richer information and span wider fields of view. 


Reactive autonomy 

Most efforts in small autonomous drones have focused on achieving reac- 
tive autonomy (Box 1) by translating decades worth of neuroethological 
research on vision-based insect flight into simple control algorithms and 
lightweight sensors”, with many also serving as validation of biological 
models. Insect vision relies on compound eyes, which are dense arrays of 
facets pointing in different directions and spanning large fields of view”. 
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Figure 2 | Drone types with examples. a, Fixed-wing drones. A 10g robot 
with rudder and flap on the tail is equipped with insect-inspired cameras 
to avoid obstacles and regulate altitude (left); the 690 g eBee (right) 

with elevons on the trailing edge of the wings, is equipped with high- 
resolution cameras for imaging and ventral optic flow sensor for landing. 
b, Rotorcraft. The 380 g AR.Drone 2.0 (left) is a quadcopter equipped with 
a high-resolution camera for imaging and a ventral optic flow sensor for 
maintaining position; the 380 g Gimball robot (right) is composed of a 


Compound eyes have fixed focus, lower resolution and smaller binocular 
overlap than human eyes and therefore do not use stereo vision for dis- 
tance estimation, except for very short ranges”. To safely fly in cluttered 
environments, insects instead rely on image motion, also known as optic 
flow**”®, generated by their own displacement relative to the surround- 
ings”. It has been experimentally shown that their neural system reacts to 
optic flow patterns" to produce a large variety of flight capabilities, such 
as obstacle avoidance’, speed maintenance“, odometry estimation”, 
wall following and corridor centring”, altitude regulation’, orientation 
control” and landing’. Optic flow intensity is proportional to the dis- 
tance from objects only during translational movements, but not during 
rotational movements when it is proportional to the rotational veloc- 
ity of the agent. Furthermore, optic flow intensities also depend on the 
speed-to-distance ratio, which raises the issue of how insects can estimate 
ground speed and distance from obstacles at the same time™. 

Many vision-based insect capabilities have been replicated with small 
drones. For example, it has been shown that small fixed-wing drones” 
and helicopters™ can regulate their distance from the ground using ventral 
optic flow while a GPS was used to maintain constant speed and an IMU 
was used to regulate roll angle. The addition of lateral optic flow sensors 
also allowed a fixed-wing drone to detect near-ground obstacles”. Optic 
flow has also been used to perform both collision-free navigation and 
altitude control of indoor” and outdoor” fixed-wing drones without a 
GPS. In these drones, the roll angle was regulated by optic flow in the 
horizontal direction and the pitch angle was regulated by optic flow in 
the vertical direction, while the ground speed was measured and main- 
tained by wind-speed sensors. In this case, the rotational optic flow was 
minimized by flying along straight lines interrupted by short turns or 
was estimated with on-board gyroscopes and subtracted from the total 
optic flow, as suggested by biological models”. Other authors have even 
proposed bio-inspired control methods that do not require absolute speed 
measurement with dedicated sensors, thus eliminating a potential source 
of error. These methods consist of continuously adjusting flight speed and 
altitude to maintain constant optic flow signals, which has also been sug- 
gested by biological models. For example, altitude control and landing 
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coaxial dual propeller core protected by a decoupled and freely rotating 
spherical cage. c, Flapping-wing drones. The 19 g Nano Hummingbird 
(left) has a camera for live video streaming; the 80 mg RoboBee (right) 
can fly tethered to an external power source. d, Coordinate system of a 
generic drone. The drone position is defined in the x, y and z coordinates 
and the attitude in the yaw, roll and pitch angles. The control algorithms 
to maintain position and attitude can vary according to drone type, 
configuration of the actuators and size. 


was achieved by adding negative feedback from ventral optic flow, either 
to the control surfaces that regulate pitch angle” or to those that regulate 
thrust”. The latter method has also been shown to be effective for altitude 
control and landing on mobile platforms” and, when used in conjunction 
with lateral optic flow and lateral thrust, also for replicating flight trajec- 
tories of honeybees during indoor flight”. However, so far this method 
has been validated only on tethered drones. An error correction method 
has also been demonstrated on a quadcopter equipped with omnidirec- 
tional vision to fly in corridors by continuously making corrections aimed 
at reducing the difference between measured optic flow and optic flow 
expected from flight at the desired altitude, attitude, speed, heading and 
distance from walls. Quadcopter and flapping-wing drones are intrinsi- 
cally unstable and must compensate for positional drift generated by noise 
in gyroscope and accelerometer signals to hover in place and maintain 
attitude. It has been shown that the direction of optic flow can be used to 
reduce uncertainty in inertial sensor signals™. Wide-field optical sensors 
inspired by the simple eyes of insects, called ocelli®, have been used for 
attitude stabilization of flapping-wing drones” and quadcopters”. 

The need to detect optic flow over wide fields of view at high temporal 
frequency in a small package has also driven the development of insect- 
inspired vision sensors that are smaller, lighter and faster than single-lens 
conventional cameras. For example, some authors” have resorted to 
using the multiple optic flow sensors found in computer optical mice. 
Whereas others” have developed neuromorphic chips that not only 
extract optic flow, but also adapt to the large variety of light intensities that 
can be experienced when flying in confined spaces. Similarly, specialized 
micro optic flow sensors have been used for altitude regulation in sub-1g 
flying robots”. Miniature, curved, artificial compound eyes with a high 
density of photoreceptors, wide fields of view and computational proper- 
ties similar to insect eyes have been recently described’’”, but have not 
yet been used in drones. 

Coordinated flight of multiple drones is another instance of reactive 
autonomy, but raises it additional challenges in sensing, communication 
and control”’. Although very little is known about the sensing and con- 
trol that underlies coordinated flight in insects, it has been shown that 
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Figure 3 | Flight time against mass of small (less than 1 kg) drones. 
Examples include each of the drone types shown in Fig. 2 (fixed wing, rotary 
and flapping wing). Regardless of the type, there is a clear trend in how flight 
time scales with mass. Smaller drones have significantly reduced flight times 
(tens of seconds compared with tens of minutes for larger drones). This is due 
to actuation limitations and the physics of flight at small scales, as discussed 
in this Review, and brings about challenges for all levels of autonomy 
described in Box 1. 


cohesion in flocks of starlings is the result of individual birds aligning 
their trajectories with a small number of their nearest neighbours defined 
in topological — not metric — space”. Evidence from pigeon flocks also 
suggests that in some conditions, such as homing, a hierarchical organi- 
zation emerges whereby the trajectories of more experienced birds are 
tracked and copied by other birds’”’. Computational models of bird flock- 
ing build on the combination of three simple behavioural rules, namely 
repulsion from near individuals, movement towards far neighbours and 
alignment with average velocities of neighbours”. Remarkably, these rules 
can also account for hierarchical flocking without explicit signalling of 
experienced agents”’. However, perceiving the position of other flying 
drones is still very challenging. Infrared-based bidirectional communica- 
tion has been used for detecting the range and bearing of neighbouring 
drones in indoor environments”, but the relatively short range of a few 
metres prevents its use in outdoor environments. Consequently, recent 
demonstrations of outdoor flocking of ten fixed-wing drones” and ten 
hovering drones” relied on radio communication of GPS coordinates 
between neighbouring drones. An alternative approach to perception of 
neighbours consists of exploiting the direction and intensity of the sound 
emitted by neighbouring drone engines®', which may complement or 
replace visual cues at night or in foggy conditions. 


Cognitive autonomy 

A different approach is pursued to endow small drones with cognitive 
autonomy (Box 1). This involves leveraging decades of research in statis- 
tical methods for vision-based navigation of terrestrial vehicles, such as 
visual odometry** and simultaneous localization and mapping™. For 


example, monocular vision has been successfully used for simultaneous 
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localization and mapping of quadcopters in indoor® and outdoor envi- 
ronments**. However, these drones could not avoid obstacles that had not 
been previously detected and mapped. Stereo vision can provide depth 
information and has been used in a 4g flapping-wing vehicle to reac- 
tively avoid obstacles” and in larger quadcopters to perform simultaneous 
localization and mapping™. Furthermore, both simultaneous localization 
and mapping and altitude control have been demonstrated in quadcop- 
ters with two sets of stereo cameras, one set pointing forwards and one 
set pointing downwards”. Simultaneous localization and mapping algo- 
rithms have also been combined with optic flow methods to estimate 
distances from the surrounding environment and stabilize the drone”. 
The statistical approach to cognitive autonomy builds on high-resolu- 
tion digital cameras that can provide dense clouds of data points and on 
computationally expensive algorithms for reducing uncertainties. Conse- 
quently, cognitive autonomy requires heavier sensory payloads and more 
powerful computational units, which may explain why, so far, progress has 
been slower for small drones. Although various forms of reactive auton- 
omy have been demonstrated in several types of drones with a wide range 
of mass and flight endurance (Fig. 3), simultaneous localization, mapping, 
and path planning have so far been demonstrated mostly in hovering 
platforms with a relatively large total mass and restricted flight endurance. 


Regulatory issues 

Unleashing the socio-economic potential of small drones will require 
not only translating the scientific and technological advances described 
above into reliable products, but also creating a regulatory framework 
that will enable public and private entities to operate in full respect of 
safety, privacy and security concerns. The rapid development of com- 
mercial drones and potential benefits prompted the US Federal Aviation 
Administration (FAA) to define, in 2013, a road map for the gradual 
integration in the national airspace system of civil unmanned drones 
that can fly beyond the operator's line of sight by 2028 (ref. 91). Ina 
first “accommodation” stage, which has already started, the FAA allows 
operation of unmanned aerial vehicles for specific services by issuing 
airworthiness certificates on request and by defining a set of standards 
and procedures. In the second “integration” stage, the FAA aims to imple- 
ment the standards and procedures for operation of civil unmanned 
aircrafts while retaining on-demand certification for cases that do not 
fit in the civil unmanned category, and to establish six test sites with 
diverse geographical and climate conditions to study safety and privacy 
issues. In the final “evolution” stage, the FAA aims to continuously refine 
and update regulations, policies and standards on the basis of evolving 
technological developments and experiences. The FAA has identified 
sense-and-avoid and communication-and-control as research priorities 
for full integration of unmanned drones, but for the foreseeable future 
it will continue to require continuous supervision by a human operator 
with certified training. In the European Union the legislation is more 
fragmented because aircraft below 150 kg are separately regulated by 
individual member states. Some countries, including the Czech Republic, 
France, Ireland, Italy, Sweden, Switzerland and the United Kingdom, 
have national regulations for unmanned aircraft; although these reg- 
ulations differ between countries. For example, the United Kingdom 
allows operation of drones of less than 20 kg within the line of sight 
over congested areas on request to the UK Civil Aviation Authority”; 
France allows remotely operated aircraft to operate beyond the line of 
sight and near cities under specific conditions”’; and Switzerland allows 
autonomous operation of drones within the line of sight as long as the 
operator can regain control” if, for example, the drone loses altitude or 
is about to collide with something. In 2013, the European Commission 
defined a road map” for the integration of remotely operated aircraft in 
the European aviation system using a staged approach, the aims, timeline 
and research priorities of which are similar to those defined in the US 
road map. However, the first stage of the European road map consists of 
extending European regulations for large unmanned aircraft systems to 
those below 150kg by adapting to national regulations, which may result 
in more liberal regulations than those in the United States. Furthermore, 
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the European Commission explicitly aims to improve the public percep- 
tion of drones and to promote widespread use of these technologies for 
public and commercial use”. 


The future of small drones 

Scaling up the use of small drones from a niche market to widespread use 
in civilian applications depends on two related prerequisites: the capabil- 
ity to autonomously and safely manoeuvre in confined spaces and the 
removal of the legal requirement of supervised operation within the line 
of sight. We do not foresee major scientific or technological roadblocks 
to achieving higher levels of autonomous control in research and com- 
mercial drones within the next five years. However, the legal requirement 
ofa certified human operator within the line of sight of every single drone 
is a roadblock that will almost certainly stay for the next five years in the 
United States and Europe, and removing it will depend on the reliability 
and safety of small drones. 

Assuming that the legal roadblock will gradually be lifted, we expect 
that reactive forms of control autonomy (Box 1) will become widely 
available within the next 5-10 years for small commercial drones for 
long-range operation. We also anticipate that bio-inspired approaches 
will dominate because they require relatively simple computation and 
sensors. For example, some commercial drones (such as the eBee and 
the AR.Drone 2.0 in Fig. 2) already use ventral optic flow for outdoor 
landing and for indoor position stabilization. In this context, an interest- 
ing scientific challenge will be to understand how different navigation 
capabilities can be integrated into a coherent control system akin to the 
nervous system of a flying insect. In parallel, an important engineering 
challenge will be to define test conditions and performance standards in 
cooperation with governmental institutions and industrial associations 
to assess the capabilities and reliability of drone technologies. 

We also expect rapid progress in cognitive autonomy, which will con- 
tinue to be driven by the development of artificial intelligence for smart- 
phones capable of identifying human users, learning their behaviours 
and creating representations of their environment (http://www.google. 
com/atap). On the one hand, face recognition and gesture-based interac- 
tion without wearable devices will become widely available for hobby and 
toy drones within the next five years; for example, by equipping small 
drones with human-motion-sensing devices developed by the gaming 
industry. On the other hand, mapping and path planning for autono- 
mous flight in partially unknown and changing environments will rep- 
resent a challenge for small drones for at least the next ten years and will 
continue to be illegal in the United States and Europe until at least 2028. 

Despite these legal roadblocks, we expect an increasing demand for 
small drones in civilian use because of their intrinsic safety. Kinetic 
energy — one measure of the potential of a drone to cause physical 
harm to a human” — is linearly proportional to the drone’s mass and 
is quadratic in velocity. Thus smaller drones are likely to cause pro- 
portionally less harm as the size is reduced, but more subtly, small 
drones typically operate at lower speeds, dramatically decreasing the 
potential for harm. For example, a 500g drone flying at 5ms ‘ has 
6.5] of kinetic energy, which is equivalent to the potential energy of a 
large apple dropped from about 2 m. As the mobile computing indus- 
try continues along the path of miniaturization, drone developers will 
continue to reap the benefits in the form of smaller, lower power sensor 
packages (for example, the IMUs used in cell phones and video game 
controllers) and the slow, but steady increase in battery energy density. 
Another hardware advance that will inevitably affect future drones is 
the movement away from general-purpose computation in favour of 
more specialized high-performance and low-power hardware accel- 
erators tuned for the various functions needed by autonomous drones 
(both reactive and cognitive autonomy). These accelerators are already 
present in various mobile devices and are suggested to bea solution to 
computation for the control of insect-scale drones”. = 
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Design, fabrication and 


control of soft robots 


Daniela Rus’ & Michael T. Tolley” 


Conventionally, engineers have employed rigid materials to fabricate precise, predictable robotic systems, which are easily 
modelled as rigid members connected at discrete joints. Natural systems, however, often match or exceed the perfor- 
mance of robotic systems with deformable bodies. Cephalopods, for example, achieve amazing feats of manipulation and 
locomotion without a skeleton; even vertebrates such as humans achieve dynamic gaits by storing elastic energy in their 
compliant bones and soft tissues. Inspired by nature, engineers have begun to explore the design and control of soft-bodied 
robots composed of compliant materials. This Review discusses recent developments in the emerging field of soft robotics. 


ing ever-more capable machines’. Softness and body compliance 

are salient features often exploited by biological systems, which 
tend to seek simplicity and show reduced complexity in their interactions 
with their environment’. Several of the lessons learned from studying 
biological systems are now culminating in the definition of a new class 
of machine that we, and others, refer to as soft robots* *. Conventional, 
rigid-bodied robots are used extensively in manufacturing and can be 
specifically programmed to perform a single task efficiently, but often 
with limited adaptability. Because they are built of rigid links and joints, 
they are unsafe for interaction with humans. A common practice is to 
separate human and robotic work spaces in factories to mitigate safety 
concerns. The lack of compliance in conventional actuation mechanisms 
is part of this problem. Soft robots provide an opportunity to bridge the 
gap between machines and people. In contrast to hard-bodied robots, soft 
robots have bodies made out of intrinsically soft and/or extensible mate- 
rials (for example, silicone rubbers) that can deform and absorb much 
of the energy arising from a collision. These robots have a continuously 
deformable structure with muscle-like actuation that emulates biologi- 
cal systems and results in a relatively large number of degrees of freedom 
compared with their hard-bodied counterparts. They (Fig. 1) have the 
potential to exhibit unprecedented adaptation, sensitivity and agility. Soft 
robots promise to be able to bend and twist with high curvatures and thus 
can be used in confined spaces’; to deform their bodies in a continuous 
way and thus achieve motions that emulate biology’; to adapt their shape 
to the environment, employing compliant motion and thus manipulate 
objects’, or move on rough terrain and exhibit resilience’; or to execute 
rapid, agile manoeuvres, such as the escape manoeuvre in fish’. 

The key challenge for creating soft machines that achieve their full 
potential is the development of controllable soft bodies using materials 
that integrate sensors, actuators and computation, and that together ena- 
ble the body to deliver the desired behaviour. Conventional approaches 
to robot control assume rigidity in the linkage structure of the robot and 
are a poor fit for controlling soft bodies, thus soft materials require new 
algorithms. 


B iology has long been a source of inspiration for engineers mak- 


What is soft? 

‘Soft’ refers to the body of the robot. Soft materials are the key ena- 
blers for creating soft robot bodies. Although Young’s modulus is 
only defined for homogeneous, prismatic bars that are subject to 


axial loading and small deformations, it is nonetheless a useful meas- 
ure of the rigidity of materials used in the fabrication of robotic 
systems’. Materials conventionally used in robotics (for example, 
metals or hard plastics) have moduli in the order of 10’-10"" pas- 
cals, whereas natural organisms are often composed of materials 
(for example, skin or muscle tissue) with moduli in the order of 
10*-10” Pa (orders of magnitude lower than their engineered coun- 
terparts; Fig. 2). We define soft robots as systems that are capable of 
autonomous behaviour, and that are primarily composed of materi- 
als with moduli in the range of that of soft biological materials. 

The advantages of using materials with compliance similar to 
that of soft biological materials include a considerable reduction 
in the harm that could be inadvertently caused by robotic systems 
(as has been demonstrated for rigid robots with compliant joints’”), 
increasing their potential for interaction with humans. Compliant 
materials also adapt more readily to various objects, simplifying 
tasks such as grasping”, and can also lead to improved mobility 
over soft substrates”. 

For the body of a soft robot to achieve its potential, facilities for 
sensing, actuation, computation, power storage and communication 
must be embedded in the soft material, resulting in smart materials. 
In addition, algorithms that drive the body to deliver the desired 
behaviours are required. These algorithms implement impedance 
matching to the structure of the body. This tight coupling between 
body and brain allows us to think about soft-bodied systems as 
machines with mechanical intelligence, in which the body can be 
viewed as augmenting the brain with morphological computa- 
tion’*”’. This ability of the body to perform computation simplifies 
the control algorithms in many situations, blurring the line between 
the body and the brain. However, soft robots (like soft organisms) 
require control algorithms, which (at least for the foreseeable future) 
will run on some sort of computing hardware. Although both body 
and brain must be considered in concert, the challenges involved 
are distinct enough that, in this Review, we find it useful to organize 
them into separate sections. 

In the following three sections we review recent developments in 
the field of soft robotics as they pertain to design and fabrication, 
computation and control, and systems and applications. We then 
discuss persistent challenges facing soft robotics, and suggest areas 
in which we see the greatest potential for societal impact. 


'Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, The Stata Center, Building 32, 32 Vassar Street, Cambridge, Massachusetts 02139, USA. 
2Department of Mechanical and Aerospace Engineering, University of California, San Diego, 9500 Gilman Drive, La Jolla, California 92093-0403. 
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Design and fabrication 

A robot is classified as hard or soft on the basis of the compliance 
of its underlying materials’. Soft robots are capable of continuum 
deformations, but not all continuum robots are soft. For example, the 
robotic elephant trunk manipulator” is a discrete hyper-redundant 
continuum robot composed of rigid materials; the articulated cath- 
eter robot!’ is an example of a hard continuum robot; OctArm” 
(Fig. 3b) is an example of a semi-soft continuum robot; while the 
caterpillar robot” (Fig. 1a) and the rolling belt robot”' are examples 
of soft continuum robots. These soft machines have modular bodies 
consisting of soft rubber segments, which can be composed seri- 
ally or in parallel to create complex morphologies. The body of a 
soft robot may consist of multiple materials with different stiffness 
properties". A soft robot encases in a soft body all the subsystems 
of a conventional robot: an actuation system, a perception system, 
driving electronics and a computation system, with corresponding 
power sources. Technological advances in soft materials and subsys- 
tems compatible with the soft body enable the autonomous function 
of the soft robot. The rest of this section describes recent progress 
in developing subsystems for soft robots. With this range of compo- 
nents, design tools are used to create the topology of the robot body 
along with the placement of its functional components. Given the 
design road map, the robot is ready to be fabricated. 


Actuation 

The segments of a soft robot are usually actuated in one of two ways 
(Fig. 4): variable length tendons (in the form of tension cables” or 
shape-memory alloy actuators”) may be embedded in soft segments, 
to achieve, for example, robotic octopus arms (Fig. 3f); or pneumatic 
actuation is used to inflate channels in a soft material and cause 
a desired deformation. Pneumatic artificial muscles (PAMs), also 
known as McKibben actuators, are examples of compliant linear soft 
actuators composed of elastomer tubes in fibre sleeves”””®. Fluidic 
elastomer actuators (FEAs) are a new type of highly extensible and 
adaptable, low-power soft actuator. FEAs comprise synthetic elas- 
tomer films operated by the expansion of embedded channels under 
pressure. Once pressurized, the actuator will keep its position with 


a >» b 


little or no additional energy consumption. FEAs can be operated 
pneumatically’’’”-” or hydraulically*’. Given a small number of 
options for pressurized working fluid generation, and a significant 
difference between the time constants for the generators and actua- 
tors, pressure-regulating components such as regulators and valves 
are necessary. Regardless of the actuation method, soft actuators are 
frequently arranged in a biologically inspired agonist-antagonist 
arrangement (like muscles) to allow bidirectional actuation. Another 
benefit of this arrangement is that co-contraction of muscle pairs 
leads to adaptable compliance. 

The design and actuation of soft-robotic systems dates back to at 
least 1992, when a team demonstrated the impressive capabilities 
of soft, flexible microactuators” (Fig. 3a). This work demonstrated 
the approach of pneumatic actuation of robotic elements composed 
of elastomer. In this approach, a fluid (usually air) is used to inflate 
channels in the elastomer, while some asymmetry in the design or 
constituent materials causes the component to actuate (move) in 
the desired way (Fig. 4). The resulting continuous, adaptive motions 
can seem surprisingly lifelike. Other groups”*** (Fig. 2b, c) used 
soft lithography techniques adapted from microfluidics, as well as 
soft composite materials composed of various silicone polymers 
and elastomers at times embedded with paper or cloth, to design 
and fabricate pneumatically actuated soft systems. One challenge 
with these designs based on pneumatic networks (also referred to 
as pneu-nets) is that the high strains required for actuation can lead 
to slow actuation rates and rupture failures. A slightly more complex 
design for pneumatically actuated elastomeric soft robots reduced 
the material strain required for actuation®®, and allowed the unteth- 
ered walking of a large, soft robot’®. Soft-lithography fabrication 
approaches typically use a layer of stiffer rubber or elastomer, some- 
times with paper, fabric or a plastic film embedded, to achieve asym- 
metric strain for actuation. An alternative approach is to augment all 
elastomeric elements with flexible fibres, which limit the stress taken 
up by the elastomer during pneumatic actuation. The result is soft 
actuators with reduced extensibility and flexibility, but the ability to 
withstand higher actuation pressures, and hence apply larger forces. 
Using complex moulding and/or free-form fabrication techniques, 


Figure 1 | Mobile soft-robotic systems inspired by a range of biological systems. a, Caterpillar-inspired locomotion”. b, A multi-gait quadruped”. c, 
Active camouflage”. d, Walking in hazardous environments'’. e, Worm-inspired locomotion™. f, Particle-jamming-based actuation”. g, Rolling powered 
by a pneumatic battery”. h, A hybrid hard-soft robot”. i, Snake-inspired locomotion’®. j, Jumping powered by internal combustion”. k, Manta-ray inspired 


locomotion”. 1, An autonomous fish". 
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Figure 2 | Approximate tensile modulus (Young’s modulus) of selected engineering and biological materials. Soft robots are composed primarily of 
materials with moduli comparable with those of soft biological materials (muscles, skin, cartilage, and so on), or of less than around 1 gigapascal. These 


materials exhibit considerable compliance under normal loading conditions. 


it is possible to embed fibres directly into pneumatic elastomeric 
actuators to achieve agile motions based on bending’*””””. 

Although most soft-robot prototypes have used pneumatic or 
hydraulic actuation, a great deal of research has focused on the 
development of electrically activated soft actuators composed of 
electroactive polymers (EAPs)***?, which have also led to prototype 
systems~. Since energy is typically most readily stored in electrical 
form, and computation is usually done on electronic circuits, it may 
be more efficient to directly use electrical potential to actuate soft 
robots. Types of EAPs include dielectric EAPs, ferroelectric poly- 
mers, electrostrictive graft polymers, liquid crystal polymers, ionic 
EAPs, electrorheological fluids, ionic polymer-metal composites, 
and stimuli-responsive gels. Since a detailed discussion is beyond the 
scope of this Review, we refer the reader to refs 38, 39 for details. In 
general, fabrication, performance, and long-term stability are active 
areas of research in EAPs. 

Instead of designing the stiffness of robotic systems by tuning 
their constituent materials, another line of soft-robotics research 
has sought to control material stiffness on the fly. One approach is 
to embed or encase soft materials with stiffer materials such as wax”? 
or metal*’, which can be thermally softened. Embedded heaters can 
thus be used to adjust the structure’s effective stiffness and allow for 
compliant behaviour or actuated repositioning. Similarly, isothermal 
phase change caused by particle jamming has also been explored as a 
method of adjusting a soft robot’s rigidity for actuation’”* (Fig. 2f), 
or even for grasping an impressive array of objects“ (Fig. 3c). 


Stretchable electronics 

So far, most integrated soft-robotic systems have relied on conven- 
tional, rigid electronics to store the control algorithms and connect 
to the systems’ actuators, sensors and power sources. However, there 
has recently been much research in the area of soft and stretchable 
electronics*””. A full discussion of this area is beyond the scope 
of this Review, but, as this field of electronics matures, we expect 
greater integration with soft robots, resulting in completely soft 
prototypes. 


Sensing 

The compliance and morphology of soft robots precludes the use 
of many conventional sensors including encoders, metal or semi- 
conductor strain gauges, or inertial measurement units (IMUs). 
Although flexible-bending sensors based on piezoelectric polymers 
are available as commercial products, these may not be appropriate 
owing to the need for all elements of the system to be both bendable 
and stretchable. Soft, stretchable electronics may enable new sensing 
modalities’. The basis of proprioceptive sensors for a soft robot is 
usually either non-contact sensors or very low modulus elastomers 
combined with liquid-phase materials. Because soft robots are actu- 
ated by generating curvatures, proprioception relies on curvature 
sensors. The low modulus of proposed elastomer sensors (which 


have characteristic moduli in the range of 10°-10° Pa) impart mini- 
mal change on the impedance of the underlying structures. These 
sensors generally have layered structures, in which multiple thin 
elastomer layers are patterned with microfluidic channels by soft 
lithography. The channels are subsequently filled with a liquid 
conductor (for example, gallium-containing alloys such as eutectic 
gallium-indium, or EGaIn). With layered channel geometries, it is 
possible to tailor sensors for measuring various strains including 
tensile, shear” or curvature’'. To address the fabrication challenges 
involved in injecting complex channel networks with liquid conduc- 
tor, recent work has investigated mask deposition of the conduc- 
tor”, or direct 3D printing of conductive material”. Alternatively, 
exteroceptive sensing may be used to measure the curvature of a 
soft robot’s body segments in real time’. To expand the applications 
of soft robotics, compatible chemical and biological sensors™* may 
be used to sense environmental signals. Such sensors may be more 
compatible with soft robots than the optical and audio recorders 
typically used in robotics. 


Power sources 

A big challenge for soft robots is stretchable, portable power 
sources for actuation. For pneumatic actuators, existing fluidic 
power sources are not soft and are usually big and bulky. Current 
off-the-shelf pressure sources are generally limited to compressors 
or pumps, and compressed air cylinders”. If we draw an analogy to 
electrical systems, compressors are similar to generators as they con- 
vert electrical energy into mechanical energy, and compressed gas 
cylinders are similar to capacitors as they store a pressurized fluid 
ina certain volume to be discharged when required. Miniature com- 
pressors use valuable electrical energy inefficiently, and cylinders of 
useful form factors do not offer longevity. What has been missing 
for fluidic systems is the equivalent of a battery, whereby a chemical 
reaction generates the necessary energy for actuation using a fuel. 
The chemically operated portable pressure source, or pneumatic 
battery”* (Fig. 2g), generates pressurized gas using a hydrogen per- 
oxide monopropellant*. Combustible fuels are another promising 
high-energy-density chemical fuel source”. 

Electrically powered actuators (as well as the electrical controllers 
for pneumatic systems) require soft, flexible, lightweight electrical 
power sources”. As with soft electronics, this is an active area of 
research. Recent promising developments include batteries based on 
graphene, organic polymers” and embedded conductive fabric”. 


Design 

Existing soft robotic systems have typically been designed with con- 
ventional 3D computer-aided design (CAD) software. However, cur- 
rent CAD software was not created with free-form 3D fabrication 
processes in mind, and does not easily accommodate the complex 
non-homogeneous 3D designs that may be desired for soft robot- 
ics. This has led researchers to either rely on relatively simple ‘2.5D’ 
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Figure 3 | Grasping and manipulation, which are canonical challenges 
in robotics, can be greatly simplified with soft robotics. Examples 

of experimental soft-robotic manipulation systems demonstrating 
microactuation” (a), soft-continuum manipulation” (b), grasping with 


layered designs, or come up with customized approaches to the 
design and fabrication of each system, typically based on commer- 
cial 3D moulding techniques’’”*. Following an alternative approach, 
researchers have used design automation algorithms inspired by evo- 
lution to design soft robots®. Soft-robot designs have been auto- 
matically generated using custom finite element analysis software 
(VoxCAD), which accommodates materials with a large range of 
moduli, coupled with design optimization using an evolutionary 
algorithm™. In addition, evolutionary algorithms have been used 
to automatically generate soft-robot designs”. 


Fabrication 

Recent progress in the field of soft robotics has been enabled by the 
development of rapid digital design and fabrication tools. Research- 
ers have manufactured complex soft-robotic systems by taking 
advantage of rapid and adaptable fabrication techniques”, includ- 
ing multimaterial 3D printing”, shape deposition manufacturing 
(SDM)* and soft lithography”. These techniques can be combined 
to create composites with heterogenous materials (for example, 
rubber with different stiffness moduli), embedded electronics and 
internal channels for actuation*’””. Direct digital printing with soft 
materials may be another option for fabricating arbitrary soft struc- 
tures. A variety of robot bodies can be fabricated using these tech- 
niques. The next section discusses control systems supporting a wide 
range of soft-robot applications. 


Computation and control 

Unlike the control of rigid bodies, the movements of which can 
be described by six degrees of freedom (three rotations and three 
translations about the x, y and z axes), the movements of soft bodies 
cannot be confined to planar motions. Soft materials are elastic and 
can bend, twist, stretch, compress, buckle, wrinkle and so on. Such 
motion can be viewed as offering an infinite number of degrees of 
freedom, which makes the control of soft robots very challenging. 
Controlling soft robots requires new approaches to modelling, con- 
trol, dynamics, and high-level planning. 
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particle jamming” (c), simple gripper fabrication by soft lithography” (d), 
underactuated dextrous grasping” (e), octopus-inspired manipulation (f), 
inflatable robotic manipulators” (g), feedback control of a multisegmented 
arm” (h) anda soft glove for rehabilitation” (i). 


The muscle analogy for soft actuators has driven a number of 
biologically inspired approaches to modelling and control for soft 
materials. The octopus arm is the prototypical example of a highly 
adaptive, soft actuator in nature that has been the source of inspira- 
tion for several biomimetic designs. Octopuses form pseudo-joints 
and use human-like strategies for precise point-to-point movements 
such as fetching’’. An understanding of the working principles and 
control of soft organisms (such as the octopus) has led to a model for 
the control of soft robots”*. Likewise, the caterpillar has provided an 
ideal model for mobile soft robots’’. The study of these systems for 
the development and implementation of soft-robotic systems has, 
in turn, reflected back on our understanding of the mechanics and 
control of the associated natural systems”. 


Modelling and kinematics 

The kinematics and dynamics of soft-robotic systems are unlike 
those of conventional, rigid-bodied robots. When composed of a 
series of actuation elements, these robots approach a continuum 
behaviour. In theory, the final shape of the robot can be described 
by a continuous function, and modelling this behaviour requires 
continuous mathematics. Because soft robots are different from con- 
ventional rigid linkage-based systems, researchers have developed 
new static, dynamic and kinematic models that capture their ability 
to bend and flex”. 

Robots made entirely from soft elastomer and powered by fluids 
do not yet have well-understood models or planning and control 
algorithms, primarily because their intrinsic deformation is con- 
tinuous, complex and highly compliant. In addition, soft robots 
are often under-actuated; they can contain many passive degrees 
of freedom, and when driven with low-pressure fluids the available 
input fluid power is unable to compensate for gravitational loading. 
Designers often model the kinematics of soft robots using a sim- 
plifying assumption that leads to the piecewise constant curvature 
(PCC) model. The PCC model is equivalent to many other mod- 
elling approaches”. Building on the PCC model, researchers have 
developed methods to map the actuation space to the configuration 
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space. These approaches are robot-specific in that they integrate 
the morphology of the robot and the characteristics of the actuation 
system. One approach uses Bernoulli-Euler beam mechanics to pre- 
dict deformation*”*’®; another develops a relationship between the 
joint variables and the curvature arc parameters”, and this applies 
to high and medium pressure robots; and a third presents models 
that describe the deformation of robots actuated with low pres- 
sures''**”°. These models are the basis for the forward kinematics of 
their respective systems. However, the PCC model does not capture 
all aspects of soft robots and, in an effort to increase the envelope of 
the model, non-constant curvature models have been developed”. 

The inverse-kinematics problem, as posed in ref. 75 (computing 
the curvatures required to place a specified point on the robot body 
at a desired location), is more challenging. Existing literature has 
provided several approaches for semi-soft robots’”””’. A limitation 
of existing approaches to solving the inverse-kinematics problem for 
linear soft bodies (for example, arms) is that currently neither the 
whole body, nor the pose of the end effector (which may be impor- 
tant for manipulation, sensing, and so on), are not considered in the 
solution. Autonomous obstacle avoidance and movement through a 
confined environment are difficult without a computational solution 
to the inverse-kinematics problem that is aware of the whole body of 
the robot in space. Real-time, closed-loop curvature controllers are 
required that drive the bending of the soft pneumatic body segments 
of the manipulator despite their high compliance and lack of kin- 
ematic constraints. One method for closed-loop curvature control 
of a soft-bodied planar manipulator” uses the PCC assumption with 
a cascaded curvature controller. An alternative visual servo control 
approach for cable-driven soft-robotic manipulators has also been 
proposed”. 

The inverse-kinematics algorithm enables task-space planning 
algorithms to autonomously position their end effector (or some 
other part of their body) in free space; manoeuvre in confined envi- 
ronments; and grasp and place objects. These task—space algorithms 
require the planner to consider the entire robot body (the whole 
arm, for example, not just the end effector for manipulators). Exist- 
ing algorithms for whole-body control” assume rigid body systems 
and have not been extended to soft-bodied robots. For whole-body 
control of soft robots, it is difficult to control the configuration 
of the whole soft body owing to compliance. One computational 
approach” to whole-body planning for soft planar manipulators 
considers both the primary task of advancing the pose of the end 
effector of the soft robot, and the secondary task of positioning the 
changing envelope of the whole robot in relation to the environ- 


3 
ment’””. 


Control 

Researchers have used these models to develop new approaches to 
low-level control, inverse kinematics, dynamic operations and plan- 
ning for soft-robotic systems. An important aspect of these algo- 
rithms is the use of compliance when the robot interacts with its 
environment. Compliance allows the robot to adapt its shape and 
function to objects of unknown or uncertain geometry and is the 
basis for new control and planning algorithms for soft robots in 
the presence of uncertainly. For example, soft robots can execute 
pick and place operations without precise positioning or accurate 
geometric models of the object to be grasped. 

Low-level control for soft robots is by pressure control using pres- 
sure transducers or volume control using strain sensors. Pressure 
control accommodates differences in actuator compliance. Volume 
control is an avenue to configuration control and supports set- 
ting a maximum safe displacement limit. Most fluid- powered soft 
robots use open-loop valve sequencing to control body-segment 
actuation. Valve sequencing means that a valve is turned on for some 
duration of time to pressurize the actuator and then turned off to 
either hold or deflate it. Many soft-robotic systems use this control 
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approach’**!”*”?**, Continuously adjustable, variable pressurization 
using a fluidic drive cylinder has been demonstrated*’. Recent work 
has sought to develop control elements for pneumatic soft robots (for 
example, valves), which do not require electrical control signals. In 
this study, passive valves with memory allow the addressable con- 
trol of many soft-robotic actuators from a single controlled pressure 
source”. 

The dynamics of soft robots open the door for new robot capa- 
bilities. Much like a child on a swing set, repeatable positioning ofa 
soft tentacle places it outside of its gravity compensation envelope, 
where the end effector could accomplish tasks. Physical phenomena 
common to soft robots — including actuation limits, the self-loading 
effects of gravity, and the high compliance of the manipulator — 
can be represented as constraints within a trajectory optimization 
algorithm that operates on a dynamic model of the soft robot’. One 
example dynamic manoeuvre enables soft robots to interact with 
humans by quickly grabbing objects directly from the hand of a 
human’. Dynamic control of soft robots can be achieved using a 
new dynamic model for a spatial soft fluidic elastomer manipula- 
tor, a method for identifying all unknown system parameters (the 
soft manipulator, fluidic actuators and continuous drive cylinders 
or valving), and a planning algorithm that computes locally opti- 
mal dynamic manoeuvres through iterative learning control. This 
approach represents actuation limits, the self-loading effects of 
gravity, and the high compliance of the manipulator, as constraints 
within the optimization. 

Dynamic models of hard and semi-soft continuum robots provide 
examples for a variety of control techniques for soft robots’. 
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Figure 4 | Cross-section of common approaches to actuation of soft-robot 
bodies in resting (left) and actuated (right) states. a, Longitudinal tensile 
actuators (for example, tension cables or shape-memory alloy actuators, 
which contract when heated) along a soft-robot arm cause bending when 
activated. b, Transverse tensile actuators cause a soft-robot arm to extend 
when contracted (a muscle arrangement also seen in octopuses”). ¢, 
Pneumatic artificial muscles composed of an elastomeric tube in a woven 
fibre shell. A pressure applied internally causes the tube and shell to expand 
radially, causing longitudinal contraction. d, Fluidic elastic actuator or Pneu- 
Net design consisting of a pneumatic network of channels in an elastomer 
that expand when filled with a pressurized fluid, causing the soft body to 
bend toward a strain-limited layer (for example, a stiffer rubber or elastomer 
embedded with paper or other tensile fibres). 
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Figure 5 | A soft robotic fish. Soft robotic fish physical prototype (top) and design schematic” . Images from a high-speed video of the fish executing a C-turn 


escape manoeuvre with elapsed time indicated" (bottom). 


Other work has modelled dynamic polymeric pneumatic tubes 
subject to tip loading using a bending beam approximation®, but 
this has not been used for control. Tatlicioglu et al.*° developed a 
dynamic model for, and provided simulations of, a planar extensible 
continuum manipulator using a Lagrangian approach. Luo et al.*” 
modelled the dynamics of a soft planar snake. 


Systems and applications 

In this section we review some of the systems that have been devel- 
oped so far to address a variety of potential applications to locomo- 
tion, manipulation and human-machine interaction. 


Locomotion 

Recent work has explored the modes of locomotion possible with (or 
enabled by) soft bodies (Fig. 1). Notably, studies of caterpillars have 
led to soft-robotic systems”””, as well as to a further understanding 
of the control of motion in these animals”. An understanding of 
worm biomechanics also led to a bioinspired worm design com- 
posed of flexible materials and actuated by shape-memory actuators 
(SMAs)**, and an annelid-inspired robot actuated by dielectric elas- 
tomer’. A European initiative studied the biomechanics and con- 
trol of the octopus to produce a soft-robotic prototype”. Likewise, 
a self-contained, autonomous robotic fish actuated by soft fluidic 
actuators could swim forward, turn and adjust its depth'’* (Fig. 5). 
This robot can execute a C-turn escape manoeuvre in 100 millisec- 
onds, which is on a par with its biological counterpart, enabling it 
to be used as an instrument (a physical model) for biological stud- 
ies. Soft-robotics projects have also explored quadrupedal locomo- 
tion’’”, rolling and snake-like undulation®. Jumping has also been 
achieved using internal combustion of hydrocarbon fuels to achieve 
rapid energy release”. 

Many of the exciting applications for soft robotics (such as search- 
and-rescue operations or environmental monitoring), require an 
autonomous, mobile system. However, most experimental soft 
robotic systems rely on power and/or control signals delivered 
through pneumatic and/or electrical tethers. Since typical actua- 
tion power sources (for example, air compressors or batteries) are 
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relatively heavy (for example, 1.2 kg"), tethers greatly simplify sys- 
tem design by significantly reducing the required carrying capacity. 
One approach to achieving mobile systems is to tether a soft robot to 
a mobile rigid robot with a greater carrying capacity®’. Untethered 
mobile systems have circumvented the challenge of carrying heavy 
power sources by operating underwater’! or rolling on a surface** 
such that actuation system masses do not have to be lifted against 
gravity. Another approach has led to materials and designs tailored 
to operate at working pressures that are high enough to carry large 
payloads”. 


Manipulation 

Manipulation is one of the canonical challenges for robotics (Fig. 3). 
Soft systems have a natural advantage over rigid robots in grasping 
and manipulating unknown objects because the compliance of soft 
grippers allows them to adapt to a variety of objects with simple con- 
trol schemes. Grippers that employ isothermal phase change owing 
to granular jamming have taken advantage of this feature of soft sys- 
tems**”*. Under-actuated grippers composed of silicone elastomers 
with embedded pneumatic channels have also shown impressive 
adaptability’**’. Commercially developed systems have also demon- 
strated manipulation with lightweight grippers composed of inflated 
flexible (but not extensible) material*!. As one of the more mature 
applications of soft-robotic technology, companies have begun to 
produce soft-robotic manipulators (for example, Pneubotics an Oth- 
erlab company, Empire Robotics and Soft Robotics). 


Medical and wearable applications 

One of the natural advantages of soft-robotic systems is the compat- 
ibility of their moduli with those of natural tissues for medical and 
wearable applications. Rigid medical devices or orthoses run the risk 
of causing damage or discomfort to human or animal tissue. In addi- 
tion, it can be difficult to perfectly replicate the motion of natural 
joints with rigid orthodics. One possibility is to integrate a degree 
of compliance into wearable devices, for example for orthopaedic 
rehabilitation. Recently, researchers have begun to look at medi- 
cal, wearable applications for soft robotics, including soft wearable 
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input devices (for example, wearable keyboards”), soft orthodics for 
ankle-foot rehabilitation’, soft sensing suits for lower-limb meas- 
urement”’, soft actuated systems for gait rehabilitation of rodents 
who have had their spinal cord surgically cut”, anda soft system for 
simulation of cardiac actuation”. 


Soft cyborgs 

Recent work has begun to investigate robotic systems that directly 
integrate biological (as opposed to artificial, biologically compatible) 
materials. Because biological materials are often very soft, the result- 
ing systems are soft-robotic systems (or perhaps they would be more 
aptly named soft cyborgs). For example, microbes that digest organic 
material and produce electricity have powered artificial muscles for 
autonomous robots”, and cardiomyocytes have been used to power 
a jellyfish-inspired swimming cyborg”. One challenge with using 
swarms of inexpensive soft robots for exploration is retrieving the 
robots once the task is completed. One way to avoid this problem is 
to develop biodegradable and soft robots powered by gelatin actua- 
tors”’. Since gelatin is edible, there may also be medical applications 
for this technology. 


Future directions 

The field of soft robotics aims to create the science and applications of 
soft autonomy by asking the questions: how do we design and control soft 
machines, and how do we use these new machines? Current research on 
the algorithmic and device-level aspects of soft robots has demonstrated 
soft devices that can locomote, manipulate, and interact with people 
and their environment in unique ways. Soft mobile robots capable of 
locomotion on unknown terrain and soft-robot manipulators capable 
of pose-invariant and shape-invariant grasping rely on compliance to 
mitigate uncertainty and adapt to their environment and task. These 
basic behaviours open the door to applications in which robots work 
closely with humans. For example, a soft-robot manipulator could han- 
dle and prepare a wide variety of food items in the kitchen while a soft 
mobile robot could use contact to guide patients during physiotherapy 
exercises. The soft home robot could assist the elderly with monitoring 
their medicine regimens, whereas a soft factory robot could support 
humans in delicate assembly tasks. But how do we get to the point where 
soft robots deliver on their full potential? We need rapid design tools and 
fabrication recipes for low-cost soft robots, novel algorithmic approaches 
to the control of soft robots that account for their material properties, 
tools for developing device-specific programming environments that 
allow non-experts to use the soft machines, creative individuals to design 
new solutions and early adopters of the technology. 

The soft-robotics community is creating materials that have the ability 
to compute, sense and move, leading to a convergence between materials 
and machines. The materials are becoming more like computers owing 
to embedded electro-mechanical components, and machines are get- 
ting softer owing to the alternative materials used to create them. This 
convergence requires tools for functional specification and automated 
co-design of the soft body (including all the components necessary for 
actuation, sensing and computation) and the soft brain (including all 
software aspects of controlling the device and reasoning about the world 
and the goal task). Progress in new materials with programmable stiff- 
ness properties, proprioceptive sensing, contact modelling, and recipes 
for rapid fabrication will enable the creation of increasingly more capable 
soft machines. 

But how soft should a soft robot be in order to meet its true potential? 
As with many aspects of robotics, this depends on the task and environ- 
ment. For domains that require extreme body compliance (for example, 
inspecting a pipe with complex geometry or laproscopic surgery) or 
dealing with a great amount of uncertainty (for example, locomotion 
on rocky uneven terrain or grasping unknown objects), soft machines 
can bring the capabilities of robots to new levels. However, soft robots 
are difficult to model and control, especially when they need to retain a 
desired body configuration without external support (for example, an 
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elephant trunk holding an object at a desired height in the presence of 
gravity). Augmenting soft machines with skeletons, much like verte- 
brates, could simplify such tasks. An important consideration for the 
future of soft machines is how to match the softness of the body to the 
range of capabilities expected from the machine in a way that most effec- 
tively leverages the body. 

Rapid fabrication recipes and tools for soft robots with embedded 
electronics are needed to expand the user base beyond experts. In addi- 
tion, more advances are needed to expand the computation and control 
capabilities of existing systems. The current approaches to robot control 
rely on external localization. But soft robots that operate autonomously 
need the ability to localize proprioceptively. Current models for soft 
robots do not capture their dynamics. Improved dynamics models will 
lead to more capable controllers. Although the current planners for soft 
robots allow the systems to collide harmlessly with their environments, 
the collisions are not detected. Contact with the environment, however, 
can bea useful aspect of task-level planning. 

Soft robots have the potential to provide a link between living systems 
and artificial systems at multiple levels: high-level tasks, in interactions 
between humans and robots, and in cognition. Pfeifer and Bongard” 
have argued that bodies are central to the way that we think. This view — 
embodied artificial intelligence — holds that robotic bodies and brains 
must be considered and developed in concert, much as the bodies and 
brains of their biological counterparts co-evolved. Soft-robotic systems 
have the potential to exploit morphological computation to adapt to, and 
interact with, the world in a way that is difficult or impossible with rigid 
systems. Following the principles of embodied artificial intelligence, soft 
robots may allow us to develop biologically inspired artificial intelligence 
in ways that are not possible with rigid-bodied robots. m 
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From evolutionary computation 
to the evolution of things 


Agoston E. Eiben’ & Jim Smith? 


Evolution has provided a source of inspiration for algorithm designers since the birth of computers. The resulting field, 
evolutionary computation, has been successful in solving engineering tasks ranging in outlook from the molecular to the 
astronomical. Today, the field is entering a new phase as evolutionary algorithms that take place in hardware are developed, 
opening up new avenues towards autonomous machines that can adapt to their environment. We discuss how evolution- 
ary computation compares with natural evolution and what its benefits are relative to other computing approaches, and 
we introduce the emerging area of artificial evolution in physical systems. 


he proposal that evolution could be used as a metaphor for prob- 

lem solving came with the invention of the computer’. In the 

1970s and 1980s the principal idea was developed into different 
algorithmic implementations under names such as evolutionary pro- 
gramming’, evolution strategies** and genetic algorithms’, followed 
later by genetic programming*. These branches merged in the 1990s, 
and in the past 20 years so-called evolutionary computation or evo- 
lutionary computing has proven to be highly successful across a wide 
range of computational tasks in optimization, design and modelling’. 
For instance, urgent needs in the development of low-cost thin-film 
photovoltaic technologies were addressed using genetic algorithms 
for topology optimization”. This led to highly efficient light-trapping 
structures that exhibited more than a threefold increase over a classic 
limit, and achieved efficiency levels far beyond the reach of intuitive 
designs. It has also been convincingly demonstrated that evolutionary 
approaches are powerful methods for knowledge discovery. For exam- 
ple, equations were evolved to model motion-tracking data captured 
from various physical systems, ranging from simple harmonic oscilla- 
tors to chaotic double pendula. This approach discovered, with limited 
prior knowledge of physics, kinematics or geometry, several laws of 
geometric and momentum conservation, and uncovered the ‘alphabet’ 
used to describe those systems". 

From the perspective of the underlying substrate in which the evolu- 
tion takes place, the emergence of evolutionary computation can be 
considered as a major transition of the evolutionary principles from 
‘wetware, the realm of biology, to software, the realm of computers. 
Today the field is at an exciting stage. New developments in robotics and 
rapid prototyping (3D printing) are paving the way towards a second 
major transition: from software to hardware, going from digital evolu- 
tionary systems to physical ones’””’. 

In this Review we outline the working principles of evolutionary algo- 
rithms, and briefly discuss the differences between artificial and natural 
evolution. We illustrate the power of evolutionary problem solving by 
discussing a number of successful applications, reflect on the features 
that make evolutionary algorithms so successful, review the current 
trends of the field, and give our perspective on future developments. 


Evolution and problem solving 

The essence of an evolutionary approach to solve a problem is to equate 
possible solutions to individuals in a population, and to introduce a 
notion of fitness on the basis of solution quality. To obtain a working 


evolutionary algorithm one has to go through a number of design steps. 
The first step is to identify a representation: a suitable data structure 
that can represent possible solutions to the problem. The next step is to 
define a way of measuring the quality of an individual based on prob- 
lem-specific requirements. The final step is to specify suitable selection 
and variation operators (Fig. 1). 

Analogous to natural evolution, an evolutionary algorithm can be 
thought of as working on two levels. At the higher level (the original 
problem context), phenotypes (candidate solutions) have their fitness 
measured. Selection mechanisms then use this measure to choose a pool 
of parents for each generation, and decide which parents and offspring 
go forward to the next generation. At the lower level, genotypes are 
objects that represent phenotypes in a form that can be manipulated 
to produce variations (Box 1). Genotype-phenotype mapping bridges 
the two levels. At the genotypic level, variation operators generate new 
individuals (offspring) from selected parents. Mutation operators are 
based on one parent (asexual reproduction) and randomly change some 
values. Recombination operators create offspring by combining values 
from the genotypes of two (or more) parents. Finally, an execution man- 
ager controls the overall functioning of the algorithm. It regulates the 
initialization of the first population, the execution of the selection—vari- 
ation cycles, and the termination of the algorithm. It also manages the 
population size (typically kept constant) and other parameters affect- 
ing selection and variation. For instance, it determines the number of 
parents per generation, and whether mutation, recombination or both 
produce the offspring for a given set of parents. 

Evolutionary algorithms are easily transferable from one application 
to another because only two components are problem dependent: the 
way that the genotypes are converted to phenotypes and the fitness func- 
tion. The history of evolutionary computation has shown that suitable 
combinations of a few simple data structures can represent possible 
solutions to a huge variety of different problems (Box 1). In other words, 
a relatively small collection of possible genotypes can accommodate 
many different kinds of phenotypes. Just as the genetic mechanisms 
underpinning natural evolution are largely species independent, acting 
on DNA or RNA, so too in evolutionary computation the choice of suit- 
able variation operators depends solely on the data structure present in 
the genotypes and not on the specific problem being tackled. Selection 
operators do not even depend on the chosen representation, as they only 
consider fitness information. This implies that for a certain problem a 
suitable evolutionary algorithm can be designed easily, as long as the 
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problem-dependent phenotypes can be mapped to one of the ‘standard’ 
genotypes. From that point on, freely available evolutionary algorithm 
machinery can be used. 

It should be noted that just because an algorithm is formally suit- 
able, it does not necessarily mean it will be successful. Suitability only 
means that the evolutionary algorithm is capable of searching through 
the space of possible solutions of a problem, but gives no guarantees that 
this search will be either effective or efficient. 


Positioning of evolutionary computation 

From a historical perspective, humans have had two roles in evolu- 
tion. Just like any other species, humans are the product of, and are 
subject to, evolution. But for millennia (in fact, for about twice as 
long as we have used wheels) people have also actively influenced 
the course of evolution in other species — by choosing which plants 
or animals should survive or mate. Thus humans have successfully 
exploited evolution to create improved food sources” or more useful 
animals'®, even though the mechanisms involved in the transmission 
of traits from one generation to the next were not understood. 

Historically, the scope of human influence in evolution was very 
limited, being restricted to interfering with selection for survival and 
reproduction. Influencing other components, such as the design of 
genotypes, or mutation and recombination mechanisms, was far 
beyond our reach. This changed with the invention of the computer, 
which provided the possibility of creating digital worlds that are 
very flexible and much more controllable than the physical reality 
we live in. Together with the increased understanding of the genetic 
mechanisms behind evolution, this brought about the opportunity 
to become active masters of evolutionary processes that are fully 
designed and executed by human experimenters ‘from above. 

It could be argued that evolutionary algorithms are not faithful 
models of natural evolution (Table 1). However, they certainly are a 
form of evolution. As Dennett’® said “If you have variation, heredity, 
and selection, then you must get evolution”. 

From a computer-science perspective, evolutionary algorithms are 


REVIEW 


Initialization 


Variation 


Selection 


Termination 


Figure 1 | The principal diagram of evolutionary algorithms. The 
initialization process seeds the search with a population of randomly 
created solutions. After this the algorithm enters a loop of evaluating the 
current generation of solutions, selecting some to act as the basis for the next 
generation, and then creating new solutions through variation (mutation or 
crossover). Periodically, the algorithm checks to see whether user-specified 
termination criteria are met — such as reaching a desired level of fitness, or 
undergoing a certain number of generations without improvement. 


randomized heuristic search methods based on the generate-and-test 
principle: producing an offspring amounts to generating a new point 
in the search space, and testing is done through fitness evaluation. 
What distinguishes evolutionary algorithms from other algorithms 
in computer science is the unique combination of being stochastic 
and maintaining their working memory in the form ofa population 
of candidate solutions. It should be noted that there are many varia- 
tions of the generic evolutionary computation template under various 
names. Today, the family of evolutionary algorithms includes histori- 
cal members: genetic algorithms, evolution strategies, evolutionary 
programming and genetic programming; and younger siblings, such 
as differential evolution and particle swarm optimization”. These 


BOX1 


Genotypes are objects that represent phenotypes (candidate solutions to 
problems) in a form that can be manipulated to produce variations. 
Examples of data structures frequently used as genotypes are shown 
in the Figure. One suitable mutation operator is shown for each, with 
its action shown by red arrows. Note that the mutation operator must 
deliver a child of the same data type — for example, a valid mutation 
operator for permutations must result in a valid permutation. Complex 
problems might require complex genotypes with appropriate mutation 
operators. 
Bitstrings are the natural choice for problems for which solutions 
are composed of on/off or true/false decisions. The most commonly 
used mutation operator makes an independent choice in each position 
whether to invert the bit value. 


entities, such as in routing or scheduling. One simple mutation operator 
swaps the values in two randomly selected locations. 

Real-valued vectors can capture continuous optimization problems, 
for example where the variables represent quantities such as dimensions 
or mass. Typically, the mutation operator perturbs each value by adding 
a (small) random number. 

Trees are branching data structures suitable for representing 
equations, logical expressions or program code. A common mutation 


new, randomly generated one. 


Representing solutions to problems as genotypes 


Permutations can be used when the problem involves ordering a set of 


operator selects a node at random, and replaces the subtree below with a 


Bitstrings 


v Y 


1110011000 
Parent 


1010011001 
Child 


Permutations 


yy 
abcdefghijk 


Parent 


agcdefbhijk 


Child 


Real-valued vectors 


bo 44 


Parent 


1.1 -0.6 3.3 4.0 9.0 


Child 


Parent 
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Table 1| Main differences between natural evolution and evolutionary algorithms 


Natural evolution 


Evolutionary algorithms 


Fitness 


Observed quantity: a posteriori effect of selection and reproduction (‘in the eye of 
the observer’). 


Selection 


Complex multifactor force based on environmental conditions, other individuals 
of the same species and those of other species (predators). Viability is tested 
continually; reproducibility is tested at discrete times. 


Genotype-phenotype mapping 


Highly complex biochemical and developmental process influenced by the 
environment. 


Variation 


Offspring are created from one (asexual reproduction) or two parents (sexual 
reproduction). Horizontal gene transfer can accumulate genes from more 
individuals. 


Execution 


Parallel, decentralized execution; birth and death events are not synchronized. 


Population 


Spatial embedding implies structured populations. Population size varies 
according to the relative number of birth and death events. Populations can and 
do go extinct. 


differ in some details, terminology or motivational metaphor, but are, 
in essence, all instances of the same algorithmic template. 

It is common to categorize algorithms according to completeness 
(can they generate every possible solution), optimality (are they guar- 
anteed to find the best solution, and identify it as such) and efficiency. 
The completeness of an evolutionary algorithm can be achieved by 
an appropriate choice of representation and variation operators. 
The optimality is a more complex issue. Although optimal methods 
exist for many problems, their run time scales so poorly that they 
are impractical to use in most non-trivial cases — hence the inter- 
est in heuristic methods. As long as the heredity principle (similar 
individuals have similar fitness) holds, an evolutionary algorithm 
will have a ‘basic instinct’ to improve the populations fitness over 
time — because the selection operators are biased towards choos- 
ing fitter individuals for reproduction and survival. Thus, if we can 
define artificial fitness on the basis of a criterion grounded in the 
problem to be solved then the evolutionary algorithm will tend to 
find solutions that optimize the fitness values, or at least approximate 
them. This implies that evolutionary algorithms can be used to solve 
optimization problems and, consequently, any problem that can be 
transformed into an equivalent optimization task. This includes most 
problems in design, and those connected with building or learning 
models from data. Nevertheless, it is important to understand that 
evolutionary algorithms are not optimizers”, but approximators, and 
they are not optimal since we might not know whether the fitness of 
the best evolved solution is in fact the highest value possible. Yet, they 
become very interesting when approximate solutions are acceptable, 
for instance, if the global optimum is not known or not required. 


Applications of evolutionary computation 

The hypothesis that embedding the principles of evolution within com- 
puter algorithms can create powerful mechanisms for solving difficult, 
poorly understood problems is now supported by a huge body of evi- 
dence. Evolutionary problem solvers have proven capable of delivering 
high-quality solutions to difficult problems in a variety of scientific and 
technical domains, offering several advantages over conventional opti- 
mization and design methods. 

One appealing example from the design domain concerns X-band 
antennas for the NASA Space Technology 5 (ST5) spacecraft”. The 
normal approach to this task is very time and labour intensive, rely- 
ing heavily on expert knowledge. The evolutionary-algorithm-based 
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Predefined a priori quantity that drives selection and reproduction. 


Randomized operator with selection probabilities based on given fitness 
values. Survivor selection and parent selection both happen at discrete times. 


Typically a simple mathematical transformation or parameterized procedure. A 
few systems use generative and developmental genotype-phenotype maps. 


Unconstrained vertical gene transfer. Offspring may be generated from any 
number of parents: one, two or many. 


Typically centralized with synchronized birth and death. 


Typically unstructured and panmictic (all individuals are potential partners). 
Population size is usually kept constant by synchronizing time and number of 
birth and death events. 


approach not only discovered effective antenna designs, but could also 
adjust designs quickly when requirements changed. One of these anten- 
nas was actually constructed and deployed on the ST5 spacecraft, thus 
becoming the first computer-evolved hardware in space. This project 
also demonstrates a specific advantage of evolutionary over manual 
design. The evolutionary algorithms generated and tested thousands 
of completely new solutions, many with unusual structures that expert 
antenna designers would be unlikely to produce. Evolutionary algo- 
rithms have also been successful in many other aeronautical and aero- 
space engineering endeavours. Problems in this field typically have 
highly complex search spaces and multiple conflicting objectives. Pop- 
ulation-based methods such as evolutionary algorithms have proven 
effective at meeting the challenges of this combination. In particular, 
so-called multi-objective evolutionary algorithms change the selection 
function to explicitly reward diversity, so that they discover and main- 
tain high-quality solutions representing different trade-offs between 
objectives — technically, they approximate diverse segments of the 
Pareto front’®. Many examples can also be found in bioinformatics. For 
instance, by mining the ChEMBL database (which contains bioactive 
molecules with drug-like properties), a set of transformations of chemi- 
cal structures was identified that were then used as the mutation opera- 
tor in an automated drug-design application”. The results showed clear 
benefits, particularly in accommodating multiple target profiles such as 
desired polypharmacology. This nicely illustrates how other approaches, 
or existing knowledge, can be easily co-opted or accommodated within 
an evolutionary computing framework. 

Numerical and combinatorial optimization are important applica- 
tion areas of evolutionary algorithms. Particularly challenging is black- 
box optimization, where the nature of the objective function requires 
numerical (rather than analytical) methods, and gradient informa- 
tion can only be approximated by sampling solutions. A systematic 
experimental study compared mathematical programming and evo- 
lutionary computation of a range of synthetic black-box optimization 
problems, which were allowed different amounts of computing time 
and resources”’. The results showed that mathematical programming 
algorithms — that were designed to provide quick progress in the ini- 
tial stages of the search — outperform evolutionary algorithms if the 
maximum number of evaluations is low, but this picture changes if the 
computational budget is increased. Ultimately, the study concludes that 
an evolutionary algorithm, in particular BIPOP-CMA-ES, is able to 
find the optimum of a broader class of functions, solve problems with 
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a higher precision and solve some problems faster. The power of evolu- 
tion strategies (especially the very successful CMA-ES variants”) for 
real-life black-box optimization problems from industry has been dis- 
cussed extensively””. Evidence gathered from years of academic research 
and development for industrial applications suggests that the niche for 
evolution strategies is formed by optimization tasks with a very lim- 
ited budget for how many solutions can have their fitness evaluated. 
Although this finding is not in line with conventional wisdom within 
the field, there is ample support for this proposal. 

Machine learning and modelling is another prominent area in which 
evolutionary algorithms have proved their power, especially as many 
contemporary approaches would otherwise rely on (often crude) greedy 
or local search algorithms to refine and optimize models. For example, 
neuroevolutionary approaches use evolutionary algorithms to optimize 
the structure, parameters, or both simultaneously, of artificial neural 
networks****. In other branches of machine learning, using evolutionary 
computing to design algorithms has been shown to be very effective as 
an alternative to handcrafting them, for instance, for inducing decision 
trees. Furthermore, evolutionary algorithms have been applied to pre- 
diction problems. For instance, to tackle the problem of predicting the 
tertiary structure of a protein, an algorithm was designed to evolve a key 
component of automated predictors — the function used to estimate a 
structure’s energy”. State-of-the-art methods in protein-structure pre- 
diction are limited by assuming a linear combination of energy terms, 
whereas a genetic programming method easily accommodates expres- 
sions based on a much richer syntax. The best energy function found 
by the genetic programming algorithm provided significantly better 
prediction guidance than conventional functions. The algorithm was 
able to automatically discover the most and least useful energy terms, 
without having any knowledge of how these terms alone are correlated 
to the prediction error. 

The design of controllers for physical entities, such as machinery 
or robots, has proved to be another fruitful area. For example, control 
strategies for operating container cranes were evolved using a physi- 
cal crane to determine fitness values”. The evolution of controllers is 
also possible in situ, for example in a population of robots during, and 
not just before, their operational period**””. Evolutionary robotics” is 
an especially challenging application area because of two additional 
issues that other branches of evolutionary computing do not face: the 
very weak and noisy link between controllable design details and the 
target feature or features; and the great variety of conditions under 
which a solution should perform well. Normally in evolutionary com- 
puting there is a three-step evaluation chain: genotype to phenotype 
to fitness. For robots the chain is four-step: genotype to phenotype to 
behaviour to fitness. In this four-step chain the robots morphology 
and controller form the phenotype. However, it could be argued that 
the behaviour should be considered as phenotype, because it is the 
entity that is being evaluated. Furthermore, the behaviour depends 
on many external factors, creating an unpredictable environment in 
which the robot is expected to perform. Nevertheless, since the manual 
design of an autonomous and adaptive mobile robot is extremely dif- 
ficult, evolutionary approaches offer large potential benefits. These 
include the possibility of continuous and automated design, manu- 
facture and deployment of robots of very different morphologies and 
control systems”. Several studies have demonstrated such benefits, 
in which robot control systems that were automatically generated by 
artificial evolution were comparatively simpler or more efficient than 
those engineered using other design methods”. In all cases, robots 
initially exhibited uncoordinated behaviour, but a few hundreds of 
generations were sufficient to achieve efficient behaviours in a wide 
range of experimental conditions. 

Several state-of-the-art algorithms for applications across a great vari- 
ety of problem domains are based on hybridizing evolutionary search 
with existing algorithms, especially local search methods. This kind of 
hybridization can be thought of as adding ‘lifetime learning’ to the evolu- 
tionary process. Freed from the restrictions of natural evolution (such as 
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Figure 2 | Two major transitions in the history of artificial evolution. In 
the twentieth century computer technology enabled artificial Darwinian 
processes in silico — the evolution of digital entities. In the twenty-first 
century, developments in robotics, materials science and 3D printing will 
enable the evolution of physical artefacts or machines. 


learned traits not being written back immediately to the genotype), and 
being able to experiment with novel types of individual and social learn- 
ing, the theory and practice of so-called memetic algorithms has become 
an important topic in the field». Such hybrid algorithms can often find 
good (or better) solutions faster than a pure evolutionary algorithm when 
the additional method searches systematically in the vicinity of good 
solutions, rather than relying on the more randomized search carried 
out by mutation”. For example, the cell suppression problem (decid- 
ing which data cells to disclose in published statistical tables in order to 
protect respondents’ confidentiality) was solved using a combination of 
graph partitioning, linear programming and evolutionary optimization 
of the sequence in which vulnerable cells were considered. This produced 
methods that could protect published statistical tables at a size that was 
several orders of magnitude greater than had previously been possible. 
Memetic algorithms have obtained an eminent place among the best 
approaches to solving really hard problems. 


State of the art 

Although initially considerable scepticism surrounded evolution- 
ary algorithms, over the past 20 years evolutionary computation has 
grown to become a major field in computational intelligence’’. As well 
as solving hard problems in various application areas, the emphasis of 
evolutionary algorithms on randomness as a source of variation has 
been shown to have particular advantages: the lack of problem-specific 
preconceptions and biases of the algorithm designer opens up the way 
to unexpected original’ solutions that can even have artistic value”*" 
(http://endlessforms.com/). The perception of evolution as a problem 
solver has broadened from seeing evolution as a heuristic algorithm for 
(parametric) optimization to considering it to be a powerful approach 
for (structural) design”. 

In general, evolutionary algorithms have proven competitive in 
solving hard problems in the face of challenging characteristics like 
non-differentiability, discontinuities, multiple local optima, noise and 
nonlinear interactions among the variables, especially if the computa- 
tional budgets are sufficiently high. Evolution is a slow learner, but the 
steady increase in computing power, and the fact that the algorithm is 
inherently suited to parallelization, mean that more and more genera- 
tions can be executed within practically acceptable timescales. 

The performance of evolutionary algorithms has also been com- 
pared with that of human experts, and there is now substantial and 
well-documented evidence of evolutionary algorithms producing meas- 
urably human-competitive results™. The annual Humies competition 
(http://www.genetic-programming.org/combined.php), which rewards 
human-competitive results from evolutionary computation, highlights 
the great variety of hard problems for which evolutionary algorithms 
have delivered excellent solutions. 

The success and popularity of evolutionary algorithms can be attrib- 
uted to a number of algorithmic features, which makes them attractive. 
First, they are assumption free because applying an evolutionary algo- 
rithm consists of specifying the representation for candidate solutions 
and providing an external function that first transforms the genotype 
into a candidate solution and then provides an evaluation. Internally, 
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evolutionary algorithms make no explicit assumptions about the 
problem, hence they are widely applicable and easily transferable at 
low cost. Second, they are flexible because they can be easily used in 
collaboration with existing methods, such as local search; they can be 
incorporated within, or make use of, existing toolsets; and combina- 
tions with domain-specific methods often lead to superior solvers 
because they can exploit the best features of different approaches. 
Third, they are robust, owing to the use of a population and rand- 
omized choices, which mean evolutionary algorithms are less likely to 
get trapped in suboptimal solutions than other search methods. They 
are also less sensitive to noise or infidelity in the models of the system 
used to evaluate solutions, and can cope with changes in the problem. 
Fourth, they are not focussed on a single solution because having 
a population means that an algorithm terminates with a number of 
solutions. Thus, users do not have to pre-specify their preferences and 
weighting in advance, but can make decisions after they see what is 
possible to achieve. This is a great advantage for problems with many 
local optima, or with a number of conflicting objectives. Finally, they 
are capable of producing unexpected solutions because they are blind 
to human preconceptions and so can find effective, but non-intuitive 
solutions, which are often valuable in design domains. 

The theoretical underpinning of evolutionary algorithms remains a 
hard nut to crack. Mathematical analysis can illuminate some properties, 
but even digital evolutionary processes exhibit very complex dynamics 
that allow only limited theory forming, despite the diverse set of tools 
and methods ranging from quantitative genetics to statistical physics”. 
One important theoretical result is the no free lunch theorem. This states 
that evolutionary algorithms are not generic super solvers — but neither 
is any other method, because there is no such thing”. Instead, “an evo- 
lutionary algorithm is the second best solver for any problem’, meaning 
that in many cases a carefully hand-crafted solver that exploits problem 
characteristics is superior for the problem at hand, but that it might take 
years to create that solver. A long-standing issue for theorists is algorithm 
convergence. Early results were based on Markov-chain analysis and 
addressed convergence in general”’, but more recent work found spe- 
cific relationships between algorithmic set-up and expected run times”™. 

59,60 


Despite all the difficulties, the field is making progress in theory”. 


Important research trends 
The development of evolutionary computation continues along a num- 
ber of research threads. 


Automated design and tuning of evolutionary algorithms 
Experience has shown that there are several design choices behind an 
evolutionary algorithm that greatly influence its performance. To reduce 
the number of design decisions to be made, and the impact of poor 
choices, the community is working on automated design aids. These can 
customize an initial algorithm set-up for a given problem offline (before 
the run) or online (during the run)”. Techniques such as automated 
parameter tuning” © and adaptive parameter control continue to make 
advances in this area®™. 


Using surrogate models 

Increasingly, evolutionary algorithms are being used for problems in 
which evaluating each population member over many generations 
would take too long to permit effective evolution given the resources 
available. A range of approaches — collectively known as surrogate 
models — are being developed that use computationally cheaper models 
in place of full fitness evaluations, and that refine those models through 
occasional full evaluations of targeted individuals”. 


Handling many objectives 

Having proven highly successful for finding solutions to problems 
with multiple objectives (typically up to ten)”*, the community is now 
making rapid advances in the field of many objectives — moving 
way beyond the capabilities of other algorithms”. In tandem with 
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algorithmic advances, this has spurred renewed interest in interac- 
tive evolutionary algorithms, which have been successfully applied to 
elicit user preferences and knowledge in many areas from design to 
art” (http://picbreeder.org). Results suggest a useful synergy with peri- 
odic user interaction to incorporate preferences that help to focus the 
search down to a more manageable set of dimensions’*. Importantly, 
this involves eliciting user preferences in response to what is discovered 
to be possible, rather than a priori. 


Generative and developmental representations 

Further to the conventionally simple genotype-phenotype mappings, 
the use of indirect encodings is gaining traction. Such generative 
and developmental representations allow the reuse of code, which 
helps to scale up the complexity of artificially evolved phenotypes, for 
instance, in evolutionary robotics, artificial life and morphogenetic 
engineering” ®’. 


Outlook 

The range of problems to which evolutionary algorithms have been 
successfully applied has grown year on year, and there is every reason 
to expect this to continue. In the future, we expect to see increasing 
interest in applying evolutionary algorithms to embodied or embed- 
ded systems; that is, employing evolution in populations for which the 
candidate solutions are controllers or drivers that implement the opera- 
tional strategy for some situated entities, and are evaluated within the 
context of some rich dynamic environment; not for what they are, but 
for what they do. Examples include policies for Web-crawlers, infor- 
mation retrieval strategies, software for machinery and smart devices, 
and controllers for autonomous robots***’. In such cases the evolved 
solutions are embedded in entities that exist and act in a ‘habitat’ the 
internet or the physical world, that is too complex and dynamic to be 
modelled perfectly. Enhancing the system with the ability to evolve 
and adapt after deployment can complement the offline optimization 
approach employed during the design stage. The novelty of such systems 
is that evolutionary changes take place within the operational period. 
These systems will be different because they replace the conventional 
design-and-deploy approach by a design-deploy—adapt loop in which 
the evolutionary component is a principal part of the system. 

This approach is already gaining traction in two areas. In the field of 
search-based software engineering, evolutionary algorithms are gain- 
ing prominence in response to the mismatch between the availability of 
expert software engineers and the explosion of interconnected devices 
requiring new and/or updated software**. Meanwhile, recent develop- 
ments in rapid fabrication technologies (3D printing) and ever smaller 
and more powerful robotic platforms mean that evolutionary comput- 
ing is now starting to make the next major transition to the automated 
creation of physical artefacts and ‘smart’ objects’ (Fig. 2). In the long 
term this could lead to a disruptive robotic technology in which design 
and production are replaced by selection and reproduction without the 
involvement of human designers and human-operated facilities. 

Last but not least, we foresee a fruitful cross-fertilization with biol- 
ogy in the coming decade based on a bidirectional flow of inspira- 
tion, understanding and knowledge. On the one hand, the advancing 
insights in molecular and evolutionary biology can be used to make 
more sophisticated evolutionary algorithms and may help to solve pre- 
viously intractable problems. The opportunities and challenges of this 
avenue have been outlined in a research agenda to transform artificial 
evolution to computational evolution®. On the other hand, a new kind 
of artificial evolution — the evolution of things — opens new horizons 
for biologists. In 1992, the evolutionary biologist John Maynard Smith 
commented: “So far, we have been able to study only one evolving sys- 
tem and we cannot wait for interstellar flight to provide us with a second. 
If we want to discover generalizations about evolving systems, we will 
have to look at artificial ones””’. Artificial evolution implemented on 
real hardware, as in evolutionary robotics, offers a new research instru- 
ment to this end”. The use of real hardware overcomes the principal 
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deficiency of software models, which lack the richness of matter that is 
a source of challenges and opportunities not yet matched in artificial 
algorithms”’. Hence, they can provide new insights into fundamental 
issues such as the factors influencing evolvability, resilience, the rate of 
progress under various circumstances, or the co-evolution of mind and 
body. Using a non-biochemical substrate for such research is becoming 
technologically ever more feasible, and it increases the generalizability 
of the findings. In particular, using a different medium for evolutionary 
studies can separate generic principles and ground truth from effects 
that are specific for carbon-based life as we know it. m 
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Middle Pliocene hominin species diversity has been a subject of debate over the past two decades, particularly after the 
naming of Australopithecus bahrelghazali and Kenyanthropus platyops in addition to the well-known species 
Australopithecus afarensis. Further analyses continue to support the proposal that several hominin species co-existed 
during this time period. Here we recognize a new hominin species (Australopithecus deyiremeda sp. nov.) from 
3.3-3.5-million-year-old deposits in the Woranso-Mille study area, central Afar, Ethiopia. The new species from 
Woranso- Mille shows that there were at least two contemporaneous hominin species living in the Afar region of Ethiopia 
between 3.3 and 3.5 million years ago, and further confirms early hominin taxonomic diversity in eastern Africa during the 
Middle Pliocene epoch. The morphology of Au. deyiremeda also reinforces concerns related to dentognathic (that is, jaws 
and teeth) homoplasy in Plio-Pleistocene hominins, and shows that some dentognathic features traditionally associated 
with Paranthropus and Homo appeared in the fossil record earlier than previously thought. 


Recent fossil hominin discoveries from eastern and central Africa 
have been described as indicating the existence of multiple locomotor 
adaptations’ and taxonomic diversity”> during the Middle Pliocene. 
Notably, Kenyanthropus platyops from Kenya and Australopithecus 
bahrelghazali from Chad were argued to be morphologically distinct 
from the contemporaneous Au. afarensis*°. Concerns have been 
raised about the validity of these two taxa, either because identifica- 
tion was based largely on a single specimen or because the type spe- 
cimen was heavily distorted®. Yet further studies addressing these 
concerns maintain the initial proposal of taxonomic diversity*”. 
More recently, we described a 3.4-million-year (Myr)-old partial foot 
(BRT-VP-2/73) from the Burtele area of Woranso-Mille, Ethiopia’. 
The specimen is contemporaneous with Au. afarensis, but demon- 
strates the existence of a distinct mode of bipedal locomotion. The 
BRT-VP-2/73 foot combines a grasping hallux with lateral rays that 
exhibit adaptation to hyperdorsiflexion at the metatarsophalangeal 
joint, similar to the morphology described for Ardipithecus ramidus’. 
This specimen was not assigned to any taxon, pending the recovery of 
cranial and dentognathic fossils from the same geographic and stra- 
tigraphic vicinity. Continued fieldwork in the Burtele area has 
resulted in the recovery of new dentognathic fossil remains, including 
partial and complete mandibles, a partial maxilla with dentition, and 
few isolated teeth (Table 1). Unfortunately, none of them is clearly 
associated with BRT-VP-2/73 (see Supplementary Note 1 for details). 
The new remains described here show clear similarities with Austra- 
lopithecus and lack the diagnostic features of Ardipithecus, Paran- 
thropus, Kenyanthropus and Homo. Various aspects of dentognathic 
morphology distinguish these specimens from the contemporaneous 
Au. afarensis and warrant their assignment to a new species (Fig. 1). 


Order Primates Linnaeus, 1758 
Suborder Anthropoidea Mivart, 1864 
Superfamily Hominoidea Gray, 1825 

Genus Australopithecus Dart, 1925 
Australopithecus deyiremeda sp. nov. 


Etymology. From the local Afar language terms deyi, meaning close, 
and remeda, meaning relative; thus referring to the species being a 
close relative of all later hominins. 

Holotype. The holotype is BRT-VP-3/1 (Fig. la—e), a left maxilla with 
upper second incisor-upper second molar (I’-M’) found by M. Barao 
on 4 March 2011. The originals of the holotype and paratypes are 
housed at the Paleoanthropology Laboratory of the National Museum 
of Ethiopia, Addis Ababa. 

Paratypes. The paratypes are BRT-VP-3/14 (Fig. 1f-h), a complete 
mandible corpus with the apical halves of left lower incisor-right 
lower incisor (LI,-RI,) and crowns of right lower premolar-lower 
molar (RP3-M3), lacking the ascending rami; and WYT-VP-2/10 
(Fig. 1i-k), a right edentulous mandible. Both specimens were found 
by A. Asfaw. Detailed anatomical descriptions, additional images, and 
measurements are provided in Supplementary Note 2, Extended Data 
Figs 1 and 2, and Extended Data Tables 1, 2 and 5). 

Referred specimen. BRT-VP-3/37 (Fig. 1-p), a right maxillary frag- 
ment with P*. This specimen was found only 5 m east of BRT-VP-3/14 
and both specimens may belong to the same individual. However, this 
cannot be determined with confidence owing to the fragmentary 
nature of BRT-VP-3/37. 

Localities. The Burtele (BRT) and Waytaleyta (WYT) collection areas 
are located south of the perennial Mille River and north of the Mille- 
Chifra road. All localities are situated within an area of 3km* 
(Fig. 2). The coordinates of the holotype are 11° 27’ 43.9'' N and 
40° 31’ 41.0’’ E (GPS datum WGS84). 

Provenience. The holotype and paratypes from BRT-VP-3 are 
surface finds, weathered out of a sandstone at the bottom of a 6-m 
thick section of sandstone and siltstone capped by a 2-m thick basaltic 
flow. A tuff radiometrically dated to 3.469 + 0.008 (mean + s.d.) Myr 
is located approximately 1m above the basaltic flow. The paratype 
from WYT-VP-2 was also a surface find, collected from the top of a 
thin sandstone layer about 15m above the 3.469-Myr-old tuff. 
The combined evidence from radiometric, palaeomagnetic and 
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Table 1 | Fossil hominins from BRT and WYT localities 


Specimen number Preserved elements Date discovered Discoverer Measurements 

BRT-VP-1/1 Left M3 crown fragment 20 February 2006 K. Helem /A 

BRT-VP-1/2 Left Py 20 February 2006 K. Helem 10.4 (13.0) 

BRT-VP-1/13 Right Cy 20 February 2006 C. Mesfin 9.9, 9.3 

BRT-VP-1/18 Left M2 mesial half 20 February 2006 K. Kayranto /A, 13.8 

BRT-VP-1/118 Left M3 fragment 17 February 2012 G. Alemayehu N/A 

BRT-VP-1/120 Left Py 17 February 2012 A. Elema 9.9, 10.1 

BRT-VP-2/10 Frontal fragment 20 February 2006 Y. Haile-Selassie /A 

BRT-VP-2/73 Right partial foot 15 February 2009 S. Melillo See ref. ? 

BRT-VP-2/89 Right P* 17 February 2012 H. Gebreyesus 9.6, 12.9 

BRT-VP-2/104 Molar fragment 26 February 2013 M. Mekomeli N/A 

BRT-VP-3/1 Left maxilla (I2-M?) 4 March 2011 M. Barao LI?, 6.9, 6.6; LC?, (8.3), (9.4); LPS, 7.7, 
12.0; LP*, 7.6, 11.8; LM?, 10.6, 11.0; 
LM?, 11.8, 13.2 

BRT-VP-3/14 Mandible with dentition 4 March 2011 A. Asfaw Ll,, 5.4, 8.0; Llo, 6.5, 9.0; LC:, 8.0, 
10.7; LP3, 7.6, 10.8; LP, 8.2, 10.6; 
LM, (12.6), (12.6); LMo, (14.8), 
(13.8); LM3, (15.8), (13.4) 

BRT-VP-3/37 Right maxilla fragment (P*) 1 March 2013 A. Asfaw 8.2 (12.1) 

WYT-VP-2/10 Partial edentulous mandible 5 March 2011 A. Asfaw N/A 

Dental measurements are standard mesiodistal and buccolingual/labiolingual dimensions. Values in parentheses are corrected for breakage or interproximal attrition. All measurements are in millimetres and 

were taken on the original specimens by the authors. N/A, non-dental specimens and fragments with no measurement. 


depositional rate analyses yields estimated minimum and maximum 
ages of approximately 3.3 and 3.5 Myr, respectively (Fig. 3, Extended 
Data Fig. 3 and Supplementary Note 3 for details). 

Differential diagnosis. Australopithecus deyiremeda differs from Ar. 
ramidus by having relatively and absolutely thicker enamel on its 
molars; a P, with three-roots; a robust mandible. It differs from Au. 
anamensis in lacking an extremely receding mandibular symphysis; 
having a more robust mandibular corpus and a bicuspid P3. It is 
distinguished from Au. afarensis by its overall mandibular architec- 
ture, including the lack of a lateral corpus hollow; low and anteriorly 
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positioned origin of the ascending ramus with pronounced takeoff at 
the P4/M, level. It also differs in features of the maxilla, including an 
anteriorly positioned zygomatic origin and in dental size and mor- 
phology, specifically an absolutely small upper postcanine dentition 
and an upper canine that is mesiodistally short and lacks lingual relief. 
Australopithecus deyiremeda differs from Paranthropus in lacking 
molarized premolars and reduced incisors. It is distinguished from 
K. platyops by its transversely convex subnasal region, with incisor 
alveoli considerably anterior to the bi-canine line; I’ root larger and 
more rounded than I’ root; more prognathic nasoalveolar clivus and 


Figure 1 | Holotype BRT-VP-3/1. a, Occlusal 
view. b, Lateral view. c, Superior view. d, Medial 
view. e, Anterior view. Paratype BRT-VP-3/14. 

f, Occlusal view. g, Right lateral view. h, Basal view. 
Paratype WYT-VP-2/10. i, Occlusal view. 

j, Right lateral view. k, Basal view. Referred 
specimen BRT-VP-3/37. 1, Occlusal view. 

m, Buccal view. n, Lingual view. 0, Distal view. 

p, Mesial view. 
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Figure 2 | Location map for the Burtele collection area and the BRT-VP-1, 
BRT-VP-2, BRT-VP-3 and WYT-VP-2 localities. a, Aerial photograph 
showing the location of the collection area in relation to nearby roads and 
rivers. b, Geological map of the study area showing the outcrop areas of 
sedimentary and volcanic units and the locations of fossil sites, palaeomagnetic 
samples (with number of samples indicated in parentheses), the dated Burtele 
tuff, and location of the holotype and paratypes of Australopithecus deyiremeda. 
The UTM (Universal Transverse Mercator) zone is 37P. The aerial photograph 
in a is reproduced with permission from the Ethiopian Mapping Agency. 


F root obliquely implanted, indicating some degree of procumbency; 
prominent upper canine and Pp? jugae; presence of incipient canine 
fossa and weakly expressed zygomaticoalveolar crest. Australo- 
pithecus deyiremeda can be distinguished from Au. garhi by its 
reduced subnasal prognathism and absolutely smaller canines and 
postcanine dentition. It differs from early Homo by its less incisiform 
canines, asymmetric P3, anterior margin of the mandibular ramus 
arising more anteriorly (opposite distal P4), mandibular corpus height 
tapering posteriorly from P3 to M;, and by its anterosuperiorly open- 
ing mental foramen. 


Comparisons 
The holotype maxilla (BRT-VP-3/1) has a more anteriorly positioned 
zygomatic root compared to Au. anamensis and most of the known 
Au. afarensis maxillae. However, it overlaps with most early hominins 
in the degree of subnasal prognathism (as determined by the subnasal 
angle) and palate depth*** (see Extended Data Table 2). Although the 
position of prosthion on BRT-VP-3/1 cannot be determined with 
certainty, the maximum estimated subnasal angle of this specimen 
is ~ 39°, falling within the range of most early hominins but consid- 
erably less than the angle reported for KNM-WT 40000, the holotype 
of K. platyops (47°)*. 

The incisive foramen in BRT-VP-3/1 is positioned at the distal bi- 
canine line, whereas in Au. afarensis maxillae of comparable size (for 
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Figure 3 | Palaeomagnetic directions for 16 samples collected from the 
composite stratigraphic section at Burtele. Directions are shown as the 
angular distance (delta) to the expected normal direction (000°, 22) for the 
given latitude. The stratigraphic position of the fossil localities, palaeomagnetic 
samples and carbonate samples are indicated. All palaeomagnetic samples 
exhibit normal remanence directions and fall within a single normal 
magnetozone. K-feldspar phenocrysts from a tuff ~1m above the basalt have 
been dated by the “°Ar/**Ar method, yielding a mean age of 

3.469 + 0.008 Myr old’, indicating that this 36-m section corresponds to the 
normal magnetozone C2An.3n (3.596-3.330 Ma). A minimum sedimentation 
accumulation rate of 18.6 cm per 1,000 years is calculated for 26 m of strata 
deposited in the time period <3.33-3.47 Ma. Applying this rate, the fossil 
locality BRT-VP-3 situated 7 m stratigraphically below the tuff (excluding the 
2m basalt, see Supplementary Note 3) has an estimated age of 3.50-3.47 Myr. 


example, A.L. 199-1), and early Homo specimens such as A.L. 666-1, 
the foramen is positioned posterior to this line, or at the P* level®. The 
palate anterior to the incisive foramen is also flexed inferiorly (see 
Extended Data Fig. 1i, k, 1), a feature considered as derived although it 
is reported to be variable in Au. afarensis’. BRT-VP-3/1 shows very 
little sagittal overlap between the hard palate and the nasoalveolar 
clivus (3.5 mm) unlike most Au. afarensis maxillae (see Extended 
Data Fig. 4). 

Further differences between Au. deyiremeda and Au. afarensis relate 
to the morphology of the upper dentition. The cheek teeth of Au. 
afarensis tend to exhibit roughly vertical buccal and lingual walls’®, 
whereas the buccal and lingual walls of the Au. deyiremeda premolars 
and molars converge occlusally, making the occlusal surface narrower 
than the base of the crown. The P” and P* of BRT-VP-3/1 have three 
roots (Extended Data Fig. 1h, j). This is commonly seen in Paranthopus 
species, but highly variable in Australopithecus. Kenyanthropus pla- 
tyops also has three-rooted upper premolars although P*-P* crown 
morphology is currently unknown for this species’. 

The canine crown of BRT-VP-3/1 is absolutely smaller than 
almost all known Australopithecus upper canines, specifically in its 
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and P. boisei (Extended Data Table 2). It also appears to be long- 
rooted relative to its crown height, as in Au. anamensis'*”’. 
Furthermore, the canine of BRT-VP-3/1 does not have pronounced 
lingual relief and entirely lacks the prominent mesial and distal ver- 
tical lingual grooves commonly seen in Au. afarensis upper canines’”. 
Its distal tubercle is also small and the mesial crown shoulder rela- 
tively short compared to the crown height and length of the mesial 
crest, thus it is less incisiform than Au. afarensis upper canines. It 
retains a more primitive, Au. anamensis-like condition in this regard. 

The postcanine teeth of BRT-VP-3/1, particularly the P* and M!, 
are very small (Fig. 4; see Extended Data Table 2 for summary stat- 
istics). The M! area of BRT-VP-3/1 is the smallest of all known 
Pliocene hominins (less Ar. ramidus), whereas the P* size is only 
approximated by A.L. 199-1, the smallest outlier in the Au. afarensis 
hypodigm. The M? also falls at the lower end of the Au. afarensis 
range and is slightly larger than that of KNM-WT 40000, the holotype 
of K. platyops. The P* to M’ area ratio of BRT-VP-3/1 falls within the 
range of most Pliocene hominin taxa, but outside the range of 
Paranthropus. In terms of enamel thickness, the two-dimensional 
linear enamel thickness for the M’ of BRT-VP-3/1 is similar to that 
of other early hominins such as Au. afarensis, P. robustus and 
Au. africanus. However, the two-dimensional and three-dimensional 
enamel thickness values for the M7 of this specimen are high, most 
similar to P. robustus (see Supplementary Note 4 and Extended Data 
Tables 3 and 4 for details). 

Although Au. deyiremeda and K. platyops show some similarities in 
maxillary morphology and molar size, they can be distinguished by a 
suite of qualitative characters related to the morphology of the nasoal- 
veolar clivus, shape of the dental arcade, and morphology of the 
anterior maxillary dentition. The nasoalveolar clivus in K. platyops 
is flat both sagittally and transversely’, whereas in Au. deyiremeda it is 
anteriorly convex in the transverse plane. In accordance with this 
difference, Au. deyiremeda and K. platyops differ in the shape of the 
anterior dental arcade. On the basis of the position of the preserved I' 
root, the transversely oblique orientation of the T° crown, and its 
placement relative to the canine and postcanine row, it is clear that 
the canine and incisors of BRT-VP-3/1 formed an arc with the 
incisor alveoli well anterior to the bi-canine line, as in Au. afarensis 
specimens such as A.L. 200-1la and A.L. 444-2 (ref. 9; see Extended 
Data Fig. 1f, g). In K. platyops, the incisors are arranged parallel to 
the bi-canine line as in Paranthropus and Homo rudolfensis°. 
Kenyanthropus platyops also has a derived low and curved zygoma- 
ticoalveolar crest’, whereas this feature is not sufficiently developed to 
form a visible crest in BRT-VP-3/1. Instead, the lateral alveolar wall 
blends smoothly with the base of the zygomatic bone. Furthermore, 
computed tomography scans show that the incisor roots of BRT-VP- 
3/1 are straight and inclined posterosuperiorly, indicating procumb- 
ent incisors (Extended Data Fig. 1i, k). By contrast, the incisor roots of 
K. platyops do not show any sign of procumbency’. On the basis of 
root cross-section approximately 2 mm below the buccal cervicoena- 
mel line of I’, the I’ root of BRT-VP-3/1 is also rounded and more 
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Figure 4 | Box-and-whisker diagrams 
comparing dental dimensions of the holotype 
BRT-VP-3/1 with other early hominins. Square 
roots of P*-M? areas (buccolingual X mesiodistal 
dimensions) are plotted. The horizontal edges of 
each box indicate the first quartile (Q,), the median 
(Q,), and the third quartile (Q3). The superiormost 
and inferiormost horizontal lines show the 
maximum and minimum values, respectively, 
excluding outliers, which are shown above or below 
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robust than the mesiolaterally compressed L root, unlike KNM-WT 
40000 in which the I' and I’ incisor roots are almost equal in size? 
(see Extended Data Fig. 1m). In general, K. platyops seems to be more 
derived than Au. deyiremeda in most of its maxillary and dental 
morphology. 

The overall morphology of the paratype mandibles (BRT-VP-3/14 
and WYT-VP-2/10; Fig. 1 and Extended Data Fig. 2) is comparable to 
that of eastern African robust taxa, although their overall size is within 
the range of Au. afarensis. The anterior origin of the ascending ramus 
at the posterior P, level is comparable to P. boisei mandibles such as 
the Peninj mandible and KGA-10-525 (ref. 14), and P. robustus speci- 
mens such as SK-12. Australopithecus afarensis mandibles have the 
origin of the ascending ramus high and more posterior, mostly distal 
to the first molar’*. BRT-VP-3/14 and WYT-VP-2/10 have inflated 
mandibular corpora, especially at the M,_; level, and they lack the 
lateral corpus hollow ubiquitous in Au. afarensis mandibles’. Corpus 
dimensions of BRT-VP-3/14 and WYT-VP-2/10 are notable 
because they are broader for their size than specimens assigned to 
Au. afarensis or early Homo. In this respect, they are more similar to 
Paranthropus (Fig. 5 and Supplementary Note 5). Mandibular speci- 
mens from Lomekwi whose taxonomic affinities have not been estab- 
lished with certainty (KNM-WT 8556 and KNM-WT 16006)*"’, are 
described as having robust corpora with an ascending ramus take-off 
at P, and Mj, respectively. However, the robusticity of the Lomekwi 
specimens is not as pronounced as it is in WYT-VP-2/10 and BRT- 
VP-3/14. Furthermore, the KNM-WT 8556 premolars are derived: 
the P; has a well-developed metaconid and the P, is large and molar- 
ized>. By contrast, the P; of BRT-VP-3/14 is almost unicuspid, with 
only a trace of the metaconid, and the P, is not molarized. 
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Figure 5 | Bivariate plots of mandibular corpus height and breadth among 
early hominins. a, Dimensions at the M, level. b, Dimensions at the M, level. 
The 95% confidence ellipses are shown for Au. afarensis, P. boisei and early 
Homo (see Supplementary Note 5 for details). The Au. deyiremeda corpora are 
relatively broad, comparable to Paranthropus. WYT-VP-2/10 falls outside the 
Au. afarensis ellipse at the M, level. BRT-VP-3/14 (represented by the mean of 
left and right sides) falls in an area of overlap at the M, level but at the edge of 
the Au. afarensis ellipse at the M2 level. Mandibular measurements for the 
comparative taxa were taken from the references listed in Fig. 4 and ref. 14. 
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Figure 6 | Cladogram depicting the majority-rule consensus of 17 equally 
parsimonious trees that result from a phylogenetic analysis of the features 
preserved in the Au. deyiremeda hypodigm. Tree length = 63, consistency 
index = 0.63. The position of Au. garhi and topologies within the Paranthropus 
+ Kenyanthropus + Homo clade are unstable when these features are 
considered. However, Au. deyiremeda is consistently placed as a sister taxon 
to a clade that includes Au. africanus, Paranthropus, and Homo (see 
Supplementary Notes 6-8 for further details). 


The symphysis of BRT-VP-3/14 is thinner, superoinferiorly longer, 
and has a more receding external contour than that of WYT-VP-2/10 
(Extended Data Fig. 2h, n). In the latter specimen, the superior half of 
the external contour is almost vertical but the inferior half is extremely 
receding similar to L.H.-4, the holotype of Au. afarensis”'”'*. Further 
similarities between WYT-VP-2/10 and L.H.-4 include the prom- 
inence of the superior and inferior transverse tori (hence the 
deep genioglossal fossa) and the posterior extension of the inferior 
transverse torus exceeding that of the superior transverse torus. BRT- 
VP-3/14 has weak transverse tori and a very shallow genioglossal 
fossa (similar to Au. afarensis specimens such as A.L. 333w-12 and 
A.L. 315-22) and a pronounced postincisive planum (comparable 
to A.L. 400-1). These differences are within the range of variation 
seen in Au. afarensis mandibles’ and further show intra-taxon vari- 
ability in mandibular symphyseal morphology. Although it was prev- 
iously argued that two Au. bahrelghazali mandibles (KT 12 and 
KT 40) have symphyseal morphology that distinguishes them from 
Au. afarensis mandibles’, they appear to also fall within the range of 
Au. afarensis variation”’*”. 

In terms of the dentition, BRT-VP-3/14 has small premolars com- 
pared to Au. afarensis (see Extended Data Table 5 and Supplementary 
Note 5 for details). The small mandibular premolars appear to con- 
form well with the small upper premolars and M' of BRT-VP-3/1. 
Despite the robust appearance of the mandible, the BRT-VP-3/1 
maxilla is only slightly smaller when occluded with the BRT-VP-3/14 
mandible (see Extended Data Fig. 5). 

The 2.8-Myr-old early Homo mandible (LD 350-1) recently 
reported from Ledi-Geraru”’ is similar to Au. deyiremeda and some 
Au. afarensis specimens in its anterior corpus morphology. However, 
the derived features that are present in LD 350-1 and diagnostic of 
Homo are absent in the Au. deyiremeda mandibles (for example, 
posteriorly opening mental foramen, comparable anterior and pos- 
terior corpus height, M3 mesiodistally shorter than M,). Features of 
Au. deyiremeda that are derived relative to Au. afarensis and earlier 
hominins (for example, relatively robust mandibular corpus, ante- 
riorly positioned root of the ascending ramus) are absent in 
LD 350-1 and other early Homo mandibles. Dentally, while the P4 
of BRT-VP-3/14 and LD 350-1 are comparable in size, the M, and M3 
of BRT-VP-3/14 are much larger (Extended Data Table 5). 


Phylogenetic implications 

The taxonomy and phylogenetic relationships among early hominins 
are becoming more complicated as new taxa are added to the Pliocene 
fossil record**”*” and the temporal range and systematics of early 
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Homo are reconsidered*’**. A phylogenetic analysis of the preserved 
morphological features of Au. deyiremeda fails to produce a single 
most-parsimonious cladogram. However, the results are fairly 
consistent in placing this species as a sister taxon to a clade including 
Au. africanus, Paranthropus and Homo (Fig. 6 and Supplementary 
Notes 6-8). There are many alternative phylogenies consistent with 
this topology, but the fact that Au. deyiremeda is positioned outside 
the Paranthropus and Homo clade implies that some features assoc- 
iated with one or both of these taxa are homoplastic. Therefore, the 
addition of Au. deyiremeda reinforces questions about dentognathic 
homoplasy in later Pliocene and early Pleistocene hominins. 

There is now incontrovertible evidence to show that multiple homi- 
nins existed contemporaneously in eastern Africa during the Middle 
Pliocene’’. Importantly, Woranso-Mille is geographically very close 
to Hadar (only 35 km north), where Au. afarensis is well documented. 
In combination, this suggests that multiple hominin species over- 
lapped temporally and lived in close geographic proximity. What 
remains intriguing, and requires further investigation, is how these 
taxa are related to each other and to later hominins, and what envir- 
onmental and ecological factors triggered such diversity. This has 
important implications for resource use and niche partitioning in 
Pliocene hominin palaeoecology, which can only be understood with 
the recovery and analysis of more hominin fossils and their associated 
fauna from Woranso-Mille and other contemporaneous sites. 
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Extended Data Figure 1 | Computed-tomography-based visualization of showing root morphology. i, Sagittal cross-section at I’. j, Transverse cross- 
BRT-VP-3/1. a, Lateral view. b, Medial view. c, Anterior view. d, Palatal view. _ section along the roots showing the number of roots of each tooth. k, Sagittal 
e, Superior view. f, Palatal view mirror-imaged on midline. g, Superior view cross-section at I”. 1, Midsagittal cross-section showing the shape of the palatine 
mirror-imaged on midline. Computed-tomography-based cross-sections of process and nasoalveolar clivus. m, Transverse cross-section across the incisors 
BRT-VP-3/1. h, Sagittal cross-section along the centre of the dental row and the canine. 
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Extended Data Figure 2 | Computed-tomography-based visualization of g, Basal view. h, Symphyseal cross-section. Computed-tomography-based 
BRT-VP-3/14. a, Right lateral view. b, Left lateral view. c, Sagittal cross-section _ visualization of WYT-VP-2/10. i, Right lateral view. j, Medial view. k, Occlusal 
along the centre of the right dental row showing root morphology. d, Sagittal _ view. 1, Basal view. m, Sagittal cross-section along the centre of the dental row 
cross-section along the centre of the left dental row showing root morphology. showing root morphology. n, Symphyseal cross-section. 0, Transverse cross- 
e, Occlusal view. f, Transverse cross-section along the roots showing the section along the roots showing the number of roots of each tooth. 

number of roots of each tooth. Note that the premolars have several roots. 
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Extended Data Figure 3 | Magnetostratigraphy. a, Magnetostratigraphy for 
the Burtele area, measured in five subsections beginning with the oldest strata in 
the southwest (BRT-VP-3), to the youngest at BRT-VP-2 (see Fig. 2 for sub- 
sections locations). This stratigraphy consists predominantly of claystones/ 
siltstones alternating with sandstones, and includes a 20-cm carbonate bed, a 
2-6-m basalt flow, and a 5-cm tuff bed. The sandstones occupy slightly sinuous 
fluvial channels with a low width/depth ratio in a weakly confined setting. The 
sandstone beds are continuous and interestratified with pedogenically modified 
fine-grained overbank and ephemeral lake deposits. The proportion of 
sandstones increases upwards in the section, where it is accompanied by 

the occurrence of celestine nodules indicating increased aridity. Sixteen 
palaeomagnetic samples were obtained from fine-grained lithologies. All 


+ Lateral displacement 


WNW-ESE 
*- Carbonate sample B= | Carbonate | Silt i | Basalt waxes Tyff 
< Pmag. sample —] Clay [S39 sand Recent alluvial deposits 


+ 
ET12,13 sample direction end-of-streak 


n=13, decl=350, incl=12, a95=7.2'/, k=32 


samples demonstrate normal polarity (black bars), interpreted as belonging toa 
single polarity chron C2An.3n (3.596—-3.330 Myr old) of the Astronomically 
Tuned Neogene Time Scale (ATNTS2004). b, Orthogonal demagnetization 
diagrams showing vector endpoints after alternating field and thermal 
demagnetization for samples MB13.1Aa and FS12.2b. Horizontal projections 
are shown as squares, vertical projection as diamonds, and NRM (4 mT) 
starting point as star. c, Palaeomagnetic directions for the Burtele stratigraphy 
from samples collected a long different stratigraphic horizons. Stereographic 
projection referenced to magnetic North (declination 2° E), with solid symbols 
on lower hemisphere (plus inclination). Represented directions are the mean 
directions for each sample from three specimens. The start shows the location 
of the expected normal direction for this latitude (000/22). 
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Extended Data Figure 4 | Comparisons of midsagittal cross section of the _ between the hard palate and the nasolalveolar clivus of BRT-VP-3/1 is small 
hard palate and nasoalveolar clivus. a, BRT-VP-3/1.b, A.L.444-2.¢, A.L.200- __ relative to most Au. afarensis specimens. Midsagittal cross-sections of Au. 
1.d, A.L. 486-1. e, A.L. 427-1. f, A.L. 199-1. g. A.L. 442-1. Note that the overlap afarensis maxillae were modified from Fig. 5.22 in ref. 9. 
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Extended Data Figure 5 | BRT-VP-3/1 (reversed) and BRT-VP-3/14 shown _1) is only slightly smaller. The position of the second molars is indicated to 
in occlusion. The upper canine is aligned mesial to the P3. Despite the show that the canine-to-second-molar length is comparable in both specimens. 
apparently large size of the mandible (BRT-VP-3/14), the maxilla (BRT-VP-3/ 
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Extended Data Table 1 | Measurements of the paratype mandibles 


Measurement BRT-VP-3/14 | WYT-VP-2/10 

1 Symphyseal height 42.2 40.7 
2 Maximum symphyseal depth 19.9 23.3 
3 Horizontal position of mental foramen P3/P4 P3 
4 Mental foramen height from base 19.4 18 
5 Mental foramen height from alveolar margin ~17.4 ~16.6 
6 Corpus height at P, 36.6, 35.8 35.7 
rs Corpus width at P, 21.1, 21.8 23.8 
8 Corpus height at P/M, 36.3, 35.9 34.8 
9 Corpus width at P./M, 22.8, 22.7 27.11 
10 Corpus height at M; 33.8, 33.0 34.3 
11. Corpus width at M, 24.5, 23.6 26.2 
12 Corpus height at M,/M2 33.3, 31.9 - 
13. Corpus width at M,/M2 25.3, 28.0 - 
14 Corpus height at M2 32.4, 30.5 - 
15 Corpus width at M2 28.2, 29 - 
16 Corpus height at M2/M; 30.8, 28.5 - 
17 Corpus width at M2/M; 28.1, 29.5 - 
18  P4-P, distance at mid-crown on lingual side 29.9 - 
19 M,-M, distance at mid-crown on lingual side ~33.5 - 


Corpus dimensions of BRT-VP-3/14 are reported for both right and left sides. Measurements 1-5 were taken following the methods used in ref. 25. Measurements 6-19 were taken following the methods used in 
ref. 17. All measurements are in millimetres. 
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Extended Data Table 2 | Statistical summary of measurements of early hominin maxillae and upper dentition 
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aren Palate Depth C'MD \P* Area \M' Area \M’ Area sgh 
BRT-VP-3/1 ~39 ~13.4 8.3 9.1 10.8 12.5 0.84 
Au. anamensis nn 1 _— 2 4 8 10 4 
mean 27 _— 10.8 10.6 12.5 13.7 0.90 
range —_ — 9.9-11.7 9.7-12.1 11.0- 13.1 12.3- 15.5 0.79 - 0.92 
s.d. — — —_ 1.0 0.74 1.02 0.06 
Au. afarensis on 6 6 10 15 11 12 11 
mean 34.6 11.3 9.9 10.6 12.9 13.9 0.81 
range 29 - 39 8.5 - 14 8.9-11.6 9.2-12.5 11.4- 14.2 12.3- 14.9 0.74 - 0.97 
s.d. 3.5 1.8 0.74 0.72 0.88 0.73 0.07 
K. platyops) n=1 47 — _— — — 11.9 —_— 
Au. garhi n=1 27 _ 11.6 _ _ — _ 
Au. africanus n 9 2 5 9 12 14 4 
mean 34.2 16.3 9.6 11.3 13.2 14.6 0.87 
range 30 - 37 14.5 - 18.0 8.8- 10.0 10.8 - 12.4 12.4- 14.4 13.3 - 16.4 0.84 - 0.89 
s.d. 1.9 _— 0.48 0.50 0.50 0.95 0.02 
early Homo on — _— 7 12 18 17 9 
mean — — 9.4 10.5 12.7 13.3 0.81 
range — — 8.3-11.5 97-121 11.4-14.5 12.5- 16.0 0.77 - 0.88 
s.d. _— — 1.35 0.68 0.84 0.96 0.03 
P. aethiopicus n 2 1 _— _ — — _— 
mean 34 15 _— _ _ _ _ 
range 31 - 37 _ — _— — _ _— 
s.d. _ _ _ _ _ _ _ 
P. robustus n 8 5 13 18 19 16 7 
mean 36.8 13.8 8.5 12.7 14.1 15.2 0.89 
range 32 - 39 12.2- 15.5 7.6-9.2 11.4 - 14.0 13.3 - 15.3 14.0- 16.3 0.88 - 0.97 
s.d. 4.1 1.4 0.43 0.65 0.43 0.64 0.04 
P. boisei on 2 3 8 6 12 11 6 
mean 35.9 21 8.6 14.0 15.3 16.4 0.93 
range 33 - 39 20-22 6.5- 10.8 13.5- 14.6 13.3 - 16.4 15.6 - 19.0 0.89-0.94 
s.d. _ 1.0 1.18 0.54 1.02 1.43 0.02 


Canine comparative measurements are taken from refs 13, 22, 23 and 26. Subnasal angle comparative measurements are taken from summary data compiled in ref. 4. Palate depth comparative measurements 
are from refs 3 and 27. All other data for the comparative taxa were compiled from refs 3, 23, 25, 28-33. Subnasal angle is measured as the subnasal clivus (nasospinale-prosthion) to the postcanine alveolar 


margin. Palate depth is measured at M!/M? for all specimens, except for P. aethiopicus (EP 1500/01), which was measured at P?/P* (but appears to retain a constant depth posteriorly?’. 
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Extended Data Table 3 | Enamel thickness measurements 


MicroCT scan settings 
Field of x, pixel z slice 


Fossil Eneray View size thickness 
Settings 
(mm) (mm) (mm) 
180 kV, 
BRT-VP-3/1 0.250 mA 43.01 0.042 0.046 
BRT-vp-3/14.  280KV, 434.69 0.129 0.129 
2.0 mA 
WYT-vP-2/10 290 KV, 77, g6 0.076 0.074 
1.0 mA 
BRT-VP-3/1 Two-dimensional enamel thickness 
Cusp tip thickness Maximum occlusal Maximum lateral Crown Area 
(mm) thickness (mm) thickness (mm) (mm?) 
Buccal Lingual Buccal Lingual Buccal Lingual 
M' too worn too worn 1.11 1.03 1.2 1.38 11.03 
M? 1.76 1.93 1.53 1.66 1.71 2.15 12.36 
BRT-VP-3/1 Three-dimensional enamel thickness 
Enamel Dentine Dentine AET AETSTD 
Volume area volume stele te (AET 
(mm*) (mm*) (mm) thickness) standardized) 


M” 3D 325.357 186.478 273.224 1.745 26.89 


MicroCT scan settings, two-dimensional and three-dimensional enamel thickness measurements for BRT-VP-3/1. 
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Extended Data Table 4 | Three-dimensional and two-dimensional average and relative enamel thicknesses 


3D Sample 

Enamel volume (mm°) 
Dentine volume (mm*) 
EDJ surface area (mm?) 
3D AET (mm) 


3D RET 


2D Sample 

Enamel area (mm?) 
Dentine area (mm’) 
EDJ length (mm) 
2D AET (mm) 


2D RET 


BRT-VP-3/1 
M2 


186.48 


273.22 


1.74 


28.83 


17.38 


1.42 


34.17 


Homo 


Sapiens 


39 


218.91 


226.83 


162.56 


1.43 


23.97 


257 


24.19 


20.06 


P. 


robustus 


507.6 


311.8 


1.83 


53.23 


21.87 


2.03 


28.38 


A 


africanus 


1.63 


22.79 


H. 


neanderthalensis 


53.8 


1.08 


15.55 


42 


21.97 


41.65 


20.75 


1.06 


16.44 


Pongo 


pygmaeus 


12 


197.91 


336.05 


199.75 


1.01 


14.49 


A 


23.42 


50.93 


21.34 


4:1 


15.49 


Pan 


troglodytes 


26 


137 


166.05 


40 


14.63 


36.95 


19.47 


0.75 


13.23 


ARTICLE 


Gorilla 
gorilla 


372.01 


1023.18 


375.89 


0.98 


9.77 


79.29 


28.25 


1.04 


11.68 


The values for the BRT-VP-3/1 M? are compared with extinct hominins and extant hominoids. Data for fossil hominins and hominoids taken from ref. 34. AET, average enamel thickness; EDJ, enamel-dentine 


junction; RET, relative enamel thickness. 
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Extended Data Table 5 | Mandibular dental dimensions (P3—-M3) and robusticity indices (at the M, and Mz levels) of Au. deyiremeda and other 
Plio-Pleistocene hominins 


Py M, M, Ms Robusticity Robusticity 
Index atM, —_ Index at M2 
MD BL MD BL MD BL MD BL MD BL 
BRT-VP-3/14 n=1 7.6 10.8 8.2 10.6 (12.6) (12.6) (14.8) (13.8) (15.8) (13.4) 0.72 0.91 
WYT-VP-2/10 n=1 - - - - - - - - - - 0.76 - 
KNM-WT 8556 n= 1 9.9 12.2 11.4 12.8 13.6 13.1 - (15.0) (19.0) - (0.59) - 
LD 350-1 n=1 - - 8.7 10.5 (12.2) - 13.2 12.5 13.0 12.2 (0.61) (0.69) 
Au. anamensis n 6 5 6 16 8 8 iti if i 7 2 - 
mean 9.9 10.9 9.0 10.6 12.6 11.7 14.3 13.5 15.0 13.2 0.53 - 
range 9.3 - 10.9 9.5- 12.0 74-9.8 9.6- 11.9 11.6 - 13.7 10.2 - 13.3 13.0- 15.9 12.3-14.9  13.7-17.0 12.1 - 13.7 0.51 - 0.54 - 
s.d 0.60 1.00 0.95 0.84 0.87 1.10 0.92 0.88 1.17 0.66 - - 
Au. afarensis n 29 29 26 22 31 24 33 31 26 23 22 19 
mean 9.5 10.7 9.8 11.0 13.1 12.6 14.3 13.5 15.2 13.4 0.58 0.7 
range 7.9-12.6 8.9- 13.6 7.7-11.4 9.8-11.8 10.1 - 14.8 11.0- 13.9 12.1 - 16.5 11.1 - 15.2 13.4 - 18.1 11.3 - 15.3 0.48 - 0.76 0.56 - 0.88 
s.d 1.00 1.10 1.00 0.84 0.90 0.80 1.20 1.00 1.30 1.00 0.07 0.08 
Au. africanus n 18 17 24 20 32 30 37 40 34 31 4 5 
mean 9.7 11.4 10.4 11.6 13.9 13.2 15.7 14.4 16.1 14.5 0.61 0.76 
range 8.4-11.2 9.2 -13.9 9.3 - 12.3 10.3 - 13.4 12.4-15.8 10.8 - 15.1 14.0-17.8 12.7 - 16.8 12.9-18.5 12.1 - 16.8 0.56 - 0.65 0.64 - 0.97 
s.d 0.74 TAT 0.76 0.90 1.07 0.90 0.98 0.99 1.10 1.08 0.04 0.13 
Early Homo n 12 11 9 9 15 16 14 15 12 12 22 16 
mean 10.0 10.4 10.5 10.9 14.1 12.2 15.5 13.6 15.4 13.0 0.63 0.67 
range 8.9-11.1 8.0- 12.3 9.1-11.8 8.1 - 12.7 13.0 - 15.6 10.6 - 14.1 13.9 - 18.3 11.7-15.4 14.0-17.5 11.7 - 14.7 0.51 - 0.74 0.54 - 0.79 
s.d 0.71 1.36 0.90 1.42 0.89 0.94 1.28 1.07 0.96 0.94 0.06 0.07 
P. robustus n 20 19 A 16 24 18 20 20 22 21 5 5 
mean 10.2 11.6 11.5 13.1 14.7 13.7 16.3 14.7 aba 14.5 0.66 0.76 
range 9.0-11.4 9.0 - 13.7 106-126 11.5-14.8 13.2 - 16.5 11.8-15.0 14.8-17.9 12.8-16.3 15.1-20.5 12.6 -17.0 0.55 - 0.75 0.62 - 0.83 
s.d 0.61 1.14 0.62 1.11 Ont, 0.85 0.90 0.84 1.38 1.14 0.07 0.08 
P. boisei n 5 5 9 9 10 6 8 6 9 12 25 23 
mean 11.1 13.0 13.5 14.3 16.5 15.5 18.2 16.9 16.4 20.0 0.69 0.76 
range 8.9 - 13.0 11.4-13.7 10.1 - 15.6 12.3 - 16.5 15.4 - 18.6 14.4-17.6 16.4 - 20.0 15.8 - 18.6 14.7-19.2 17.6 - 22.4 0.57 - 0.80 0.63 - 0.86 
s.d 1.66 0.93 1.76 1.28 0.97 1.17 1.33 1.11 1.56 ileal 0.05 0.06 


The values reported for BRT-VP-3/14 are averages of the left and right sides. The data for LD 350-1 are taken from ref. 21 and corrected for interproximal wear. Summary statistics for the comparative taxa were 


taken from ref. 15 and the references listed in Extended Data Table 2. Values in parentheses are estimates or corrected for breakage and/or interproximal attrition. 
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Middle Pliocene hominin species diversity has been a subject of debate over the past two decades, particularly after the 
naming of Australopithecus bahrelghazali and Kenyanthropus platyops in addition to the well-known species 
Australopithecus afarensis. Further analyses continue to support the proposal that several hominin species co-existed 
during this time period. Here we recognize a new hominin species (Australopithecus deyiremeda sp. nov.) from 
3.3-3.5-million-year-old deposits in the Woranso-Mille study area, central Afar, Ethiopia. The new species from 
Woranso- Mille shows that there were at least two contemporaneous hominin species living in the Afar region of Ethiopia 
between 3.3 and 3.5 million years ago, and further confirms early hominin taxonomic diversity in eastern Africa during the 
Middle Pliocene epoch. The morphology of Au. deyiremeda also reinforces concerns related to dentognathic (that is, jaws 
and teeth) homoplasy in Plio-Pleistocene hominins, and shows that some dentognathic features traditionally associated 
with Paranthropus and Homo appeared in the fossil record earlier than previously thought. 


Recent fossil hominin discoveries from eastern and central Africa 
have been described as indicating the existence of multiple locomotor 
adaptations’ and taxonomic diversity”> during the Middle Pliocene. 
Notably, Kenyanthropus platyops from Kenya and Australopithecus 
bahrelghazali from Chad were argued to be morphologically distinct 
from the contemporaneous Au. afarensis*°. Concerns have been 
raised about the validity of these two taxa, either because identifica- 
tion was based largely on a single specimen or because the type spe- 
cimen was heavily distorted®. Yet further studies addressing these 
concerns maintain the initial proposal of taxonomic diversity*”. 
More recently, we described a 3.4-million-year (Myr)-old partial foot 
(BRT-VP-2/73) from the Burtele area of Woranso-Mille, Ethiopia’. 
The specimen is contemporaneous with Au. afarensis, but demon- 
strates the existence of a distinct mode of bipedal locomotion. The 
BRT-VP-2/73 foot combines a grasping hallux with lateral rays that 
exhibit adaptation to hyperdorsiflexion at the metatarsophalangeal 
joint, similar to the morphology described for Ardipithecus ramidus’. 
This specimen was not assigned to any taxon, pending the recovery of 
cranial and dentognathic fossils from the same geographic and stra- 
tigraphic vicinity. Continued fieldwork in the Burtele area has 
resulted in the recovery of new dentognathic fossil remains, including 
partial and complete mandibles, a partial maxilla with dentition, and 
few isolated teeth (Table 1). Unfortunately, none of them is clearly 
associated with BRT-VP-2/73 (see Supplementary Note 1 for details). 
The new remains described here show clear similarities with Austra- 
lopithecus and lack the diagnostic features of Ardipithecus, Paran- 
thropus, Kenyanthropus and Homo. Various aspects of dentognathic 
morphology distinguish these specimens from the contemporaneous 
Au. afarensis and warrant their assignment to a new species (Fig. 1). 


Order Primates Linnaeus, 1758 
Suborder Anthropoidea Mivart, 1864 
Superfamily Hominoidea Gray, 1825 

Genus Australopithecus Dart, 1925 
Australopithecus deyiremeda sp. nov. 


Etymology. From the local Afar language terms deyi, meaning close, 
and remeda, meaning relative; thus referring to the species being a 
close relative of all later hominins. 

Holotype. The holotype is BRT-VP-3/1 (Fig. la—e), a left maxilla with 
upper second incisor-upper second molar (I’-M’) found by M. Barao 
on 4 March 2011. The originals of the holotype and paratypes are 
housed at the Paleoanthropology Laboratory of the National Museum 
of Ethiopia, Addis Ababa. 

Paratypes. The paratypes are BRT-VP-3/14 (Fig. 1f-h), a complete 
mandible corpus with the apical halves of left lower incisor-right 
lower incisor (LI,-RI,) and crowns of right lower premolar-lower 
molar (RP3-M3), lacking the ascending rami; and WYT-VP-2/10 
(Fig. 1i-k), a right edentulous mandible. Both specimens were found 
by A. Asfaw. Detailed anatomical descriptions, additional images, and 
measurements are provided in Supplementary Note 2, Extended Data 
Figs 1 and 2, and Extended Data Tables 1, 2 and 5). 

Referred specimen. BRT-VP-3/37 (Fig. 1-p), a right maxillary frag- 
ment with P*. This specimen was found only 5 m east of BRT-VP-3/14 
and both specimens may belong to the same individual. However, this 
cannot be determined with confidence owing to the fragmentary 
nature of BRT-VP-3/37. 

Localities. The Burtele (BRT) and Waytaleyta (WYT) collection areas 
are located south of the perennial Mille River and north of the Mille- 
Chifra road. All localities are situated within an area of 3km* 
(Fig. 2). The coordinates of the holotype are 11° 27’ 43.9'' N and 
40° 31’ 41.0’’ E (GPS datum WGS84). 

Provenience. The holotype and paratypes from BRT-VP-3 are 
surface finds, weathered out of a sandstone at the bottom of a 6-m 
thick section of sandstone and siltstone capped by a 2-m thick basaltic 
flow. A tuff radiometrically dated to 3.469 + 0.008 (mean + s.d.) Myr 
is located approximately 1m above the basaltic flow. The paratype 
from WYT-VP-2 was also a surface find, collected from the top of a 
thin sandstone layer about 15m above the 3.469-Myr-old tuff. 
The combined evidence from radiometric, palaeomagnetic and 


1Cleveland Museum of Natural History, Cleveland, Ohio 44106, USA. 2Case Western Reserve University, Cleveland, Ohio 44106, USA. University of Barcelona, Marti Franqués s/n, 08028 Barcelona, Spain. 
4Max Plank Institute for Evolutionary Anthropology, Deutscher Platz 6, D-04103 Leipzig, Germany. °Pennsylvania State University, University Park, Pennsylvania 16802, USA. °Addis Ababa University, PO 
Box 1176 Addis Ababa, Ethiopia. Berkeley Geochronology Center, 2455 Ridge Road, Berkeley, California 94709, USA. Johns Hopkins University, Baltimore, Maryland 21218, USA. 
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Table 1 | Fossil hominins from BRT and WYT localities 


Specimen number Preserved elements Date discovered Discoverer Measurements 

BRT-VP-1/1 Left M3 crown fragment 20 February 2006 K. Helem /A 

BRT-VP-1/2 Left Py 20 February 2006 K. Helem 10.4 (13.0) 

BRT-VP-1/13 Right Cy 20 February 2006 C. Mesfin 9.9, 9.3 

BRT-VP-1/18 Left M2 mesial half 20 February 2006 K. Kayranto /A, 13.8 

BRT-VP-1/118 Left M3 fragment 17 February 2012 G. Alemayehu N/A 

BRT-VP-1/120 Left Py 17 February 2012 A. Elema 9.9, 10.1 

BRT-VP-2/10 Frontal fragment 20 February 2006 Y. Haile-Selassie /A 

BRT-VP-2/73 Right partial foot 15 February 2009 S. Melillo See ref. ? 

BRT-VP-2/89 Right P* 17 February 2012 H. Gebreyesus 9.6, 12.9 

BRT-VP-2/104 Molar fragment 26 February 2013 M. Mekomeli N/A 

BRT-VP-3/1 Left maxilla (I2-M?) 4 March 2011 M. Barao LI?, 6.9, 6.6; LC?, (8.3), (9.4); LPS, 7.7, 
12.0; LP*, 7.6, 11.8; LM?, 10.6, 11.0; 
LM?, 11.8, 13.2 

BRT-VP-3/14 Mandible with dentition 4 March 2011 A. Asfaw Ll,, 5.4, 8.0; Llo, 6.5, 9.0; LC:, 8.0, 
10.7; LP3, 7.6, 10.8; LP, 8.2, 10.6; 
LM, (12.6), (12.6); LMo, (14.8), 
(13.8); LM3, (15.8), (13.4) 

BRT-VP-3/37 Right maxilla fragment (P*) 1 March 2013 A. Asfaw 8.2 (12.1) 

WYT-VP-2/10 Partial edentulous mandible 5 March 2011 A. Asfaw N/A 

Dental measurements are standard mesiodistal and buccolingual/labiolingual dimensions. Values in parentheses are corrected for breakage or interproximal attrition. All measurements are in millimetres and 

were taken on the original specimens by the authors. N/A, non-dental specimens and fragments with no measurement. 


depositional rate analyses yields estimated minimum and maximum 
ages of approximately 3.3 and 3.5 Myr, respectively (Fig. 3, Extended 
Data Fig. 3 and Supplementary Note 3 for details). 

Differential diagnosis. Australopithecus deyiremeda differs from Ar. 
ramidus by having relatively and absolutely thicker enamel on its 
molars; a P, with three-roots; a robust mandible. It differs from Au. 
anamensis in lacking an extremely receding mandibular symphysis; 
having a more robust mandibular corpus and a bicuspid P3. It is 
distinguished from Au. afarensis by its overall mandibular architec- 
ture, including the lack of a lateral corpus hollow; low and anteriorly 
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positioned origin of the ascending ramus with pronounced takeoff at 
the P4/M, level. It also differs in features of the maxilla, including an 
anteriorly positioned zygomatic origin and in dental size and mor- 
phology, specifically an absolutely small upper postcanine dentition 
and an upper canine that is mesiodistally short and lacks lingual relief. 
Australopithecus deyiremeda differs from Paranthropus in lacking 
molarized premolars and reduced incisors. It is distinguished from 
K. platyops by its transversely convex subnasal region, with incisor 
alveoli considerably anterior to the bi-canine line; I’ root larger and 
more rounded than I’ root; more prognathic nasoalveolar clivus and 


Figure 1 | Holotype BRT-VP-3/1. a, Occlusal 
view. b, Lateral view. c, Superior view. d, Medial 
view. e, Anterior view. Paratype BRT-VP-3/14. 

f, Occlusal view. g, Right lateral view. h, Basal view. 
Paratype WYT-VP-2/10. i, Occlusal view. 

j, Right lateral view. k, Basal view. Referred 
specimen BRT-VP-3/37. 1, Occlusal view. 

m, Buccal view. n, Lingual view. 0, Distal view. 

p, Mesial view. 
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Figure 2 | Location map for the Burtele collection area and the BRT-VP-1, 
BRT-VP-2, BRT-VP-3 and WYT-VP-2 localities. a, Aerial photograph 
showing the location of the collection area in relation to nearby roads and 
rivers. b, Geological map of the study area showing the outcrop areas of 
sedimentary and volcanic units and the locations of fossil sites, palaeomagnetic 
samples (with number of samples indicated in parentheses), the dated Burtele 
tuff, and location of the holotype and paratypes of Australopithecus deyiremeda. 
The UTM (Universal Transverse Mercator) zone is 37P. The aerial photograph 
in a is reproduced with permission from the Ethiopian Mapping Agency. 


F root obliquely implanted, indicating some degree of procumbency; 
prominent upper canine and Pp? jugae; presence of incipient canine 
fossa and weakly expressed zygomaticoalveolar crest. Australo- 
pithecus deyiremeda can be distinguished from Au. garhi by its 
reduced subnasal prognathism and absolutely smaller canines and 
postcanine dentition. It differs from early Homo by its less incisiform 
canines, asymmetric P3, anterior margin of the mandibular ramus 
arising more anteriorly (opposite distal P4), mandibular corpus height 
tapering posteriorly from P3 to M;, and by its anterosuperiorly open- 
ing mental foramen. 


Comparisons 
The holotype maxilla (BRT-VP-3/1) has a more anteriorly positioned 
zygomatic root compared to Au. anamensis and most of the known 
Au. afarensis maxillae. However, it overlaps with most early hominins 
in the degree of subnasal prognathism (as determined by the subnasal 
angle) and palate depth*** (see Extended Data Table 2). Although the 
position of prosthion on BRT-VP-3/1 cannot be determined with 
certainty, the maximum estimated subnasal angle of this specimen 
is ~ 39°, falling within the range of most early hominins but consid- 
erably less than the angle reported for KNM-WT 40000, the holotype 
of K. platyops (47°)*. 

The incisive foramen in BRT-VP-3/1 is positioned at the distal bi- 
canine line, whereas in Au. afarensis maxillae of comparable size (for 
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Figure 3 | Palaeomagnetic directions for 16 samples collected from the 
composite stratigraphic section at Burtele. Directions are shown as the 
angular distance (delta) to the expected normal direction (000°, 22) for the 
given latitude. The stratigraphic position of the fossil localities, palaeomagnetic 
samples and carbonate samples are indicated. All palaeomagnetic samples 
exhibit normal remanence directions and fall within a single normal 
magnetozone. K-feldspar phenocrysts from a tuff ~1m above the basalt have 
been dated by the “°Ar/**Ar method, yielding a mean age of 

3.469 + 0.008 Myr old’, indicating that this 36-m section corresponds to the 
normal magnetozone C2An.3n (3.596-3.330 Ma). A minimum sedimentation 
accumulation rate of 18.6 cm per 1,000 years is calculated for 26 m of strata 
deposited in the time period <3.33-3.47 Ma. Applying this rate, the fossil 
locality BRT-VP-3 situated 7 m stratigraphically below the tuff (excluding the 
2m basalt, see Supplementary Note 3) has an estimated age of 3.50-3.47 Myr. 


example, A.L. 199-1), and early Homo specimens such as A.L. 666-1, 
the foramen is positioned posterior to this line, or at the P* level®. The 
palate anterior to the incisive foramen is also flexed inferiorly (see 
Extended Data Fig. 1i, k, 1), a feature considered as derived although it 
is reported to be variable in Au. afarensis’. BRT-VP-3/1 shows very 
little sagittal overlap between the hard palate and the nasoalveolar 
clivus (3.5 mm) unlike most Au. afarensis maxillae (see Extended 
Data Fig. 4). 

Further differences between Au. deyiremeda and Au. afarensis relate 
to the morphology of the upper dentition. The cheek teeth of Au. 
afarensis tend to exhibit roughly vertical buccal and lingual walls’®, 
whereas the buccal and lingual walls of the Au. deyiremeda premolars 
and molars converge occlusally, making the occlusal surface narrower 
than the base of the crown. The P” and P* of BRT-VP-3/1 have three 
roots (Extended Data Fig. 1h, j). This is commonly seen in Paranthopus 
species, but highly variable in Australopithecus. Kenyanthropus pla- 
tyops also has three-rooted upper premolars although P*-P* crown 
morphology is currently unknown for this species’. 

The canine crown of BRT-VP-3/1 is absolutely smaller than 
almost all known Australopithecus upper canines, specifically in its 


28 MAY 2015 | VOL 521 | NATURE | 485 


©2015 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


a b c 
_ _ 16 _ 20-4 
€44 'E — 
€ — 154 — 197 
g 13 $ $ 184 
a * @ 144 G 
a4 S S$ 174 
Bay Ea ! B 184 
2a 8 8 
g i g 124 g 4 
§ 10-5 & 4 & 144 p 
9 of ° 134° 
—————— 
cCeETHSNCo CeeHNvone SCSaeonrero 
aS8a8oRu LSS oo os ESSaaaes & 
SRCBEESS SRCRZERS sf eazeSs 
° 2 i 2 3 : 23 
ree sssss rEeESs Les g ressssess 
oF 8 Seer s or s8esas ots sSeEsa 8 
SE ee ee = Sos ote ZSzeSseere 
T25a8 ST essh of e*hs58 
ge gig gt 
mesiodistal dimension, which falls close to the mean of P. robustus 


and P. boisei (Extended Data Table 2). It also appears to be long- 
rooted relative to its crown height, as in Au. anamensis'*”’. 
Furthermore, the canine of BRT-VP-3/1 does not have pronounced 
lingual relief and entirely lacks the prominent mesial and distal ver- 
tical lingual grooves commonly seen in Au. afarensis upper canines’”. 
Its distal tubercle is also small and the mesial crown shoulder rela- 
tively short compared to the crown height and length of the mesial 
crest, thus it is less incisiform than Au. afarensis upper canines. It 
retains a more primitive, Au. anamensis-like condition in this regard. 

The postcanine teeth of BRT-VP-3/1, particularly the P* and M!, 
are very small (Fig. 4; see Extended Data Table 2 for summary stat- 
istics). The M! area of BRT-VP-3/1 is the smallest of all known 
Pliocene hominins (less Ar. ramidus), whereas the P* size is only 
approximated by A.L. 199-1, the smallest outlier in the Au. afarensis 
hypodigm. The M? also falls at the lower end of the Au. afarensis 
range and is slightly larger than that of KNM-WT 40000, the holotype 
of K. platyops. The P* to M’ area ratio of BRT-VP-3/1 falls within the 
range of most Pliocene hominin taxa, but outside the range of 
Paranthropus. In terms of enamel thickness, the two-dimensional 
linear enamel thickness for the M’ of BRT-VP-3/1 is similar to that 
of other early hominins such as Au. afarensis, P. robustus and 
Au. africanus. However, the two-dimensional and three-dimensional 
enamel thickness values for the M7 of this specimen are high, most 
similar to P. robustus (see Supplementary Note 4 and Extended Data 
Tables 3 and 4 for details). 

Although Au. deyiremeda and K. platyops show some similarities in 
maxillary morphology and molar size, they can be distinguished by a 
suite of qualitative characters related to the morphology of the nasoal- 
veolar clivus, shape of the dental arcade, and morphology of the 
anterior maxillary dentition. The nasoalveolar clivus in K. platyops 
is flat both sagittally and transversely’, whereas in Au. deyiremeda it is 
anteriorly convex in the transverse plane. In accordance with this 
difference, Au. deyiremeda and K. platyops differ in the shape of the 
anterior dental arcade. On the basis of the position of the preserved I' 
root, the transversely oblique orientation of the T° crown, and its 
placement relative to the canine and postcanine row, it is clear that 
the canine and incisors of BRT-VP-3/1 formed an arc with the 
incisor alveoli well anterior to the bi-canine line, as in Au. afarensis 
specimens such as A.L. 200-1la and A.L. 444-2 (ref. 9; see Extended 
Data Fig. 1f, g). In K. platyops, the incisors are arranged parallel to 
the bi-canine line as in Paranthropus and Homo rudolfensis°. 
Kenyanthropus platyops also has a derived low and curved zygoma- 
ticoalveolar crest’, whereas this feature is not sufficiently developed to 
form a visible crest in BRT-VP-3/1. Instead, the lateral alveolar wall 
blends smoothly with the base of the zygomatic bone. Furthermore, 
computed tomography scans show that the incisor roots of BRT-VP- 
3/1 are straight and inclined posterosuperiorly, indicating procumb- 
ent incisors (Extended Data Fig. 1i, k). By contrast, the incisor roots of 
K. platyops do not show any sign of procumbency’. On the basis of 
root cross-section approximately 2 mm below the buccal cervicoena- 
mel line of I’, the I’ root of BRT-VP-3/1 is also rounded and more 
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Figure 4 | Box-and-whisker diagrams 
comparing dental dimensions of the holotype 
BRT-VP-3/1 with other early hominins. Square 
roots of P*-M? areas (buccolingual X mesiodistal 
dimensions) are plotted. The horizontal edges of 
each box indicate the first quartile (Q,), the median 
(Q,), and the third quartile (Q3). The superiormost 
and inferiormost horizontal lines show the 
maximum and minimum values, respectively, 
excluding outliers, which are shown above or below 
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robust than the mesiolaterally compressed L root, unlike KNM-WT 
40000 in which the I' and I’ incisor roots are almost equal in size? 
(see Extended Data Fig. 1m). In general, K. platyops seems to be more 
derived than Au. deyiremeda in most of its maxillary and dental 
morphology. 

The overall morphology of the paratype mandibles (BRT-VP-3/14 
and WYT-VP-2/10; Fig. 1 and Extended Data Fig. 2) is comparable to 
that of eastern African robust taxa, although their overall size is within 
the range of Au. afarensis. The anterior origin of the ascending ramus 
at the posterior P, level is comparable to P. boisei mandibles such as 
the Peninj mandible and KGA-10-525 (ref. 14), and P. robustus speci- 
mens such as SK-12. Australopithecus afarensis mandibles have the 
origin of the ascending ramus high and more posterior, mostly distal 
to the first molar’*. BRT-VP-3/14 and WYT-VP-2/10 have inflated 
mandibular corpora, especially at the M,_; level, and they lack the 
lateral corpus hollow ubiquitous in Au. afarensis mandibles’. Corpus 
dimensions of BRT-VP-3/14 and WYT-VP-2/10 are notable 
because they are broader for their size than specimens assigned to 
Au. afarensis or early Homo. In this respect, they are more similar to 
Paranthropus (Fig. 5 and Supplementary Note 5). Mandibular speci- 
mens from Lomekwi whose taxonomic affinities have not been estab- 
lished with certainty (KNM-WT 8556 and KNM-WT 16006)*"’, are 
described as having robust corpora with an ascending ramus take-off 
at P, and Mj, respectively. However, the robusticity of the Lomekwi 
specimens is not as pronounced as it is in WYT-VP-2/10 and BRT- 
VP-3/14. Furthermore, the KNM-WT 8556 premolars are derived: 
the P; has a well-developed metaconid and the P, is large and molar- 
ized>. By contrast, the P; of BRT-VP-3/14 is almost unicuspid, with 
only a trace of the metaconid, and the P, is not molarized. 


a b 
+ Early Homo 
pa 4 OP. boisei = 
e€ = P. robustus € 
€ @ Au. africanus E 
“= 35-4 A Au. afarensis "a 
= 4 Au. anamensis = 
Ss ® KNM-WT 8556 
= 7 = 
5 ae] 
g iy 
6 254 6 
ao oO 
Ss =| 
a a 
fe E 
fo} fo} 
(6) () 
154 


35 45 
Corpus height at M, (mm) 


35 45 25 
Corpus height at M, (mm) 


25 


Figure 5 | Bivariate plots of mandibular corpus height and breadth among 
early hominins. a, Dimensions at the M, level. b, Dimensions at the M, level. 
The 95% confidence ellipses are shown for Au. afarensis, P. boisei and early 
Homo (see Supplementary Note 5 for details). The Au. deyiremeda corpora are 
relatively broad, comparable to Paranthropus. WYT-VP-2/10 falls outside the 
Au. afarensis ellipse at the M, level. BRT-VP-3/14 (represented by the mean of 
left and right sides) falls in an area of overlap at the M, level but at the edge of 
the Au. afarensis ellipse at the M2 level. Mandibular measurements for the 
comparative taxa were taken from the references listed in Fig. 4 and ref. 14. 
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Figure 6 | Cladogram depicting the majority-rule consensus of 17 equally 
parsimonious trees that result from a phylogenetic analysis of the features 
preserved in the Au. deyiremeda hypodigm. Tree length = 63, consistency 
index = 0.63. The position of Au. garhi and topologies within the Paranthropus 
+ Kenyanthropus + Homo clade are unstable when these features are 
considered. However, Au. deyiremeda is consistently placed as a sister taxon 
to a clade that includes Au. africanus, Paranthropus, and Homo (see 
Supplementary Notes 6-8 for further details). 


The symphysis of BRT-VP-3/14 is thinner, superoinferiorly longer, 
and has a more receding external contour than that of WYT-VP-2/10 
(Extended Data Fig. 2h, n). In the latter specimen, the superior half of 
the external contour is almost vertical but the inferior half is extremely 
receding similar to L.H.-4, the holotype of Au. afarensis”'”'*. Further 
similarities between WYT-VP-2/10 and L.H.-4 include the prom- 
inence of the superior and inferior transverse tori (hence the 
deep genioglossal fossa) and the posterior extension of the inferior 
transverse torus exceeding that of the superior transverse torus. BRT- 
VP-3/14 has weak transverse tori and a very shallow genioglossal 
fossa (similar to Au. afarensis specimens such as A.L. 333w-12 and 
A.L. 315-22) and a pronounced postincisive planum (comparable 
to A.L. 400-1). These differences are within the range of variation 
seen in Au. afarensis mandibles’ and further show intra-taxon vari- 
ability in mandibular symphyseal morphology. Although it was prev- 
iously argued that two Au. bahrelghazali mandibles (KT 12 and 
KT 40) have symphyseal morphology that distinguishes them from 
Au. afarensis mandibles’, they appear to also fall within the range of 
Au. afarensis variation”’*”. 

In terms of the dentition, BRT-VP-3/14 has small premolars com- 
pared to Au. afarensis (see Extended Data Table 5 and Supplementary 
Note 5 for details). The small mandibular premolars appear to con- 
form well with the small upper premolars and M' of BRT-VP-3/1. 
Despite the robust appearance of the mandible, the BRT-VP-3/1 
maxilla is only slightly smaller when occluded with the BRT-VP-3/14 
mandible (see Extended Data Fig. 5). 

The 2.8-Myr-old early Homo mandible (LD 350-1) recently 
reported from Ledi-Geraru”’ is similar to Au. deyiremeda and some 
Au. afarensis specimens in its anterior corpus morphology. However, 
the derived features that are present in LD 350-1 and diagnostic of 
Homo are absent in the Au. deyiremeda mandibles (for example, 
posteriorly opening mental foramen, comparable anterior and pos- 
terior corpus height, M3 mesiodistally shorter than M,). Features of 
Au. deyiremeda that are derived relative to Au. afarensis and earlier 
hominins (for example, relatively robust mandibular corpus, ante- 
riorly positioned root of the ascending ramus) are absent in 
LD 350-1 and other early Homo mandibles. Dentally, while the P4 
of BRT-VP-3/14 and LD 350-1 are comparable in size, the M, and M3 
of BRT-VP-3/14 are much larger (Extended Data Table 5). 


Phylogenetic implications 

The taxonomy and phylogenetic relationships among early hominins 
are becoming more complicated as new taxa are added to the Pliocene 
fossil record**”*” and the temporal range and systematics of early 
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Homo are reconsidered*’**. A phylogenetic analysis of the preserved 
morphological features of Au. deyiremeda fails to produce a single 
most-parsimonious cladogram. However, the results are fairly 
consistent in placing this species as a sister taxon to a clade including 
Au. africanus, Paranthropus and Homo (Fig. 6 and Supplementary 
Notes 6-8). There are many alternative phylogenies consistent with 
this topology, but the fact that Au. deyiremeda is positioned outside 
the Paranthropus and Homo clade implies that some features assoc- 
iated with one or both of these taxa are homoplastic. Therefore, the 
addition of Au. deyiremeda reinforces questions about dentognathic 
homoplasy in later Pliocene and early Pleistocene hominins. 

There is now incontrovertible evidence to show that multiple homi- 
nins existed contemporaneously in eastern Africa during the Middle 
Pliocene’’. Importantly, Woranso-Mille is geographically very close 
to Hadar (only 35 km north), where Au. afarensis is well documented. 
In combination, this suggests that multiple hominin species over- 
lapped temporally and lived in close geographic proximity. What 
remains intriguing, and requires further investigation, is how these 
taxa are related to each other and to later hominins, and what envir- 
onmental and ecological factors triggered such diversity. This has 
important implications for resource use and niche partitioning in 
Pliocene hominin palaeoecology, which can only be understood with 
the recovery and analysis of more hominin fossils and their associated 
fauna from Woranso-Mille and other contemporaneous sites. 
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Extended Data Figure 1 | Computed-tomography-based visualization of showing root morphology. i, Sagittal cross-section at I’. j, Transverse cross- 
BRT-VP-3/1. a, Lateral view. b, Medial view. c, Anterior view. d, Palatal view. _ section along the roots showing the number of roots of each tooth. k, Sagittal 
e, Superior view. f, Palatal view mirror-imaged on midline. g, Superior view cross-section at I”. 1, Midsagittal cross-section showing the shape of the palatine 
mirror-imaged on midline. Computed-tomography-based cross-sections of process and nasoalveolar clivus. m, Transverse cross-section across the incisors 
BRT-VP-3/1. h, Sagittal cross-section along the centre of the dental row and the canine. 
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Extended Data Figure 2 | Computed-tomography-based visualization of g, Basal view. h, Symphyseal cross-section. Computed-tomography-based 
BRT-VP-3/14. a, Right lateral view. b, Left lateral view. c, Sagittal cross-section _ visualization of WYT-VP-2/10. i, Right lateral view. j, Medial view. k, Occlusal 
along the centre of the right dental row showing root morphology. d, Sagittal _ view. 1, Basal view. m, Sagittal cross-section along the centre of the dental row 
cross-section along the centre of the left dental row showing root morphology. showing root morphology. n, Symphyseal cross-section. 0, Transverse cross- 
e, Occlusal view. f, Transverse cross-section along the roots showing the section along the roots showing the number of roots of each tooth. 

number of roots of each tooth. Note that the premolars have several roots. 


©2015 Macmillan Publishers Limited. All rights reserved 


UTM 668228 / 1268779 


UTM 667600 / 1267351 __ - - — 


_ 


Mesa Burtele 


ARTICLE 


UTM 669270 / 1268248 


rtttt 


— 


BRT-VP 2 


eal 
fossa + 


5. 
0. 


silty-sand 


3a 


gravel 


<ay7 7) 
300 
> Vertebrate fossils 


100 200 


BRT-VP-3 


b 


ny Sample MB13.1Aa Sample FS12.2b y 
2 


5 
20 4 
15 
10 
r . 
10 20 
5 
1 
5 


i} 


NORTH (squares) and VERTICAL (diamonds) 


ou 


ot 
WEST TO EAST COMPONENT 


“50 -30 


Extended Data Figure 3 | Magnetostratigraphy. a, Magnetostratigraphy for 
the Burtele area, measured in five subsections beginning with the oldest strata in 
the southwest (BRT-VP-3), to the youngest at BRT-VP-2 (see Fig. 2 for sub- 
sections locations). This stratigraphy consists predominantly of claystones/ 
siltstones alternating with sandstones, and includes a 20-cm carbonate bed, a 
2-6-m basalt flow, and a 5-cm tuff bed. The sandstones occupy slightly sinuous 
fluvial channels with a low width/depth ratio in a weakly confined setting. The 
sandstone beds are continuous and interestratified with pedogenically modified 
fine-grained overbank and ephemeral lake deposits. The proportion of 
sandstones increases upwards in the section, where it is accompanied by 

the occurrence of celestine nodules indicating increased aridity. Sixteen 
palaeomagnetic samples were obtained from fine-grained lithologies. All 


+ Lateral displacement 


WNW-ESE 
*- Carbonate sample B= | Carbonate | Silt i | Basalt waxes Tyff 
< Pmag. sample —] Clay [S39 sand Recent alluvial deposits 


+ 
ET12,13 sample direction end-of-streak 


n=13, decl=350, incl=12, a95=7.2'/, k=32 


samples demonstrate normal polarity (black bars), interpreted as belonging toa 
single polarity chron C2An.3n (3.596—-3.330 Myr old) of the Astronomically 
Tuned Neogene Time Scale (ATNTS2004). b, Orthogonal demagnetization 
diagrams showing vector endpoints after alternating field and thermal 
demagnetization for samples MB13.1Aa and FS12.2b. Horizontal projections 
are shown as squares, vertical projection as diamonds, and NRM (4 mT) 
starting point as star. c, Palaeomagnetic directions for the Burtele stratigraphy 
from samples collected a long different stratigraphic horizons. Stereographic 
projection referenced to magnetic North (declination 2° E), with solid symbols 
on lower hemisphere (plus inclination). Represented directions are the mean 
directions for each sample from three specimens. The start shows the location 
of the expected normal direction for this latitude (000/22). 
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Extended Data Figure 4 | Comparisons of midsagittal cross section of the _ between the hard palate and the nasolalveolar clivus of BRT-VP-3/1 is small 
hard palate and nasoalveolar clivus. a, BRT-VP-3/1.b, A.L.444-2.¢, A.L.200- __ relative to most Au. afarensis specimens. Midsagittal cross-sections of Au. 
1.d, A.L. 486-1. e, A.L. 427-1. f, A.L. 199-1. g. A.L. 442-1. Note that the overlap afarensis maxillae were modified from Fig. 5.22 in ref. 9. 
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Extended Data Figure 5 | BRT-VP-3/1 (reversed) and BRT-VP-3/14 shown _1) is only slightly smaller. The position of the second molars is indicated to 
in occlusion. The upper canine is aligned mesial to the P3. Despite the show that the canine-to-second-molar length is comparable in both specimens. 
apparently large size of the mandible (BRT-VP-3/14), the maxilla (BRT-VP-3/ 
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Extended Data Table 1 | Measurements of the paratype mandibles 


Measurement BRT-VP-3/14 | WYT-VP-2/10 

1 Symphyseal height 42.2 40.7 
2 Maximum symphyseal depth 19.9 23.3 
3 Horizontal position of mental foramen P3/P4 P3 
4 Mental foramen height from base 19.4 18 
5 Mental foramen height from alveolar margin ~17.4 ~16.6 
6 Corpus height at P, 36.6, 35.8 35.7 
rs Corpus width at P, 21.1, 21.8 23.8 
8 Corpus height at P/M, 36.3, 35.9 34.8 
9 Corpus width at P./M, 22.8, 22.7 27.11 
10 Corpus height at M; 33.8, 33.0 34.3 
11. Corpus width at M, 24.5, 23.6 26.2 
12 Corpus height at M,/M2 33.3, 31.9 - 
13. Corpus width at M,/M2 25.3, 28.0 - 
14 Corpus height at M2 32.4, 30.5 - 
15 Corpus width at M2 28.2, 29 - 
16 Corpus height at M2/M; 30.8, 28.5 - 
17 Corpus width at M2/M; 28.1, 29.5 - 
18  P4-P, distance at mid-crown on lingual side 29.9 - 
19 M,-M, distance at mid-crown on lingual side ~33.5 - 


Corpus dimensions of BRT-VP-3/14 are reported for both right and left sides. Measurements 1-5 were taken following the methods used in ref. 25. Measurements 6-19 were taken following the methods used in 
ref. 17. All measurements are in millimetres. 
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Extended Data Table 2 | Statistical summary of measurements of early hominin maxillae and upper dentition 
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aren Palate Depth C'MD \P* Area \M' Area \M’ Area sgh 
BRT-VP-3/1 ~39 ~13.4 8.3 9.1 10.8 12.5 0.84 
Au. anamensis nn 1 _— 2 4 8 10 4 
mean 27 _— 10.8 10.6 12.5 13.7 0.90 
range —_ — 9.9-11.7 9.7-12.1 11.0- 13.1 12.3- 15.5 0.79 - 0.92 
s.d. — — —_ 1.0 0.74 1.02 0.06 
Au. afarensis on 6 6 10 15 11 12 11 
mean 34.6 11.3 9.9 10.6 12.9 13.9 0.81 
range 29 - 39 8.5 - 14 8.9-11.6 9.2-12.5 11.4- 14.2 12.3- 14.9 0.74 - 0.97 
s.d. 3.5 1.8 0.74 0.72 0.88 0.73 0.07 
K. platyops) n=1 47 — _— — — 11.9 —_— 
Au. garhi n=1 27 _ 11.6 _ _ — _ 
Au. africanus n 9 2 5 9 12 14 4 
mean 34.2 16.3 9.6 11.3 13.2 14.6 0.87 
range 30 - 37 14.5 - 18.0 8.8- 10.0 10.8 - 12.4 12.4- 14.4 13.3 - 16.4 0.84 - 0.89 
s.d. 1.9 _— 0.48 0.50 0.50 0.95 0.02 
early Homo on — _— 7 12 18 17 9 
mean — — 9.4 10.5 12.7 13.3 0.81 
range — — 8.3-11.5 97-121 11.4-14.5 12.5- 16.0 0.77 - 0.88 
s.d. _— — 1.35 0.68 0.84 0.96 0.03 
P. aethiopicus n 2 1 _— _ — — _— 
mean 34 15 _— _ _ _ _ 
range 31 - 37 _ — _— — _ _— 
s.d. _ _ _ _ _ _ _ 
P. robustus n 8 5 13 18 19 16 7 
mean 36.8 13.8 8.5 12.7 14.1 15.2 0.89 
range 32 - 39 12.2- 15.5 7.6-9.2 11.4 - 14.0 13.3 - 15.3 14.0- 16.3 0.88 - 0.97 
s.d. 4.1 1.4 0.43 0.65 0.43 0.64 0.04 
P. boisei on 2 3 8 6 12 11 6 
mean 35.9 21 8.6 14.0 15.3 16.4 0.93 
range 33 - 39 20-22 6.5- 10.8 13.5- 14.6 13.3 - 16.4 15.6 - 19.0 0.89-0.94 
s.d. _ 1.0 1.18 0.54 1.02 1.43 0.02 


Canine comparative measurements are taken from refs 13, 22, 23 and 26. Subnasal angle comparative measurements are taken from summary data compiled in ref. 4. Palate depth comparative measurements 
are from refs 3 and 27. All other data for the comparative taxa were compiled from refs 3, 23, 25, 28-33. Subnasal angle is measured as the subnasal clivus (nasospinale-prosthion) to the postcanine alveolar 


margin. Palate depth is measured at M!/M? for all specimens, except for P. aethiopicus (EP 1500/01), which was measured at P?/P* (but appears to retain a constant depth posteriorly?’. 
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Extended Data Table 3 | Enamel thickness measurements 


MicroCT scan settings 
Field of x, pixel z slice 


Fossil Eneray View size thickness 
Settings 
(mm) (mm) (mm) 
180 kV, 
BRT-VP-3/1 0.250 mA 43.01 0.042 0.046 
BRT-vp-3/14.  280KV, 434.69 0.129 0.129 
2.0 mA 
WYT-vP-2/10 290 KV, 77, g6 0.076 0.074 
1.0 mA 
BRT-VP-3/1 Two-dimensional enamel thickness 
Cusp tip thickness Maximum occlusal Maximum lateral Crown Area 
(mm) thickness (mm) thickness (mm) (mm?) 
Buccal Lingual Buccal Lingual Buccal Lingual 
M' too worn too worn 1.11 1.03 1.2 1.38 11.03 
M? 1.76 1.93 1.53 1.66 1.71 2.15 12.36 
BRT-VP-3/1 Three-dimensional enamel thickness 
Enamel Dentine Dentine AET AETSTD 
Volume area volume stele te (AET 
(mm*) (mm*) (mm) thickness) standardized) 


M” 3D 325.357 186.478 273.224 1.745 26.89 


MicroCT scan settings, two-dimensional and three-dimensional enamel thickness measurements for BRT-VP-3/1. 
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Extended Data Table 4 | Three-dimensional and two-dimensional average and relative enamel thicknesses 


3D Sample 

Enamel volume (mm°) 
Dentine volume (mm*) 
EDJ surface area (mm?) 
3D AET (mm) 


3D RET 


2D Sample 

Enamel area (mm?) 
Dentine area (mm’) 
EDJ length (mm) 
2D AET (mm) 


2D RET 


BRT-VP-3/1 
M2 


186.48 


273.22 


1.74 


28.83 


17.38 


1.42 


34.17 


Homo 


Sapiens 


39 


218.91 


226.83 


162.56 


1.43 


23.97 


257 


24.19 


20.06 


P. 


robustus 


507.6 


311.8 


1.83 


53.23 


21.87 


2.03 


28.38 


A 


africanus 


1.63 


22.79 


H. 


neanderthalensis 


53.8 


1.08 


15.55 


42 


21.97 


41.65 


20.75 


1.06 


16.44 


Pongo 


pygmaeus 


12 


197.91 


336.05 


199.75 


1.01 


14.49 


A 


23.42 


50.93 


21.34 


4:1 


15.49 


Pan 


troglodytes 


26 


137 


166.05 


40 


14.63 


36.95 


19.47 


0.75 


13.23 


ARTICLE 


Gorilla 
gorilla 


372.01 


1023.18 


375.89 


0.98 


9.77 


79.29 


28.25 


1.04 


11.68 


The values for the BRT-VP-3/1 M? are compared with extinct hominins and extant hominoids. Data for fossil hominins and hominoids taken from ref. 34. AET, average enamel thickness; EDJ, enamel-dentine 


junction; RET, relative enamel thickness. 
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Extended Data Table 5 | Mandibular dental dimensions (P3—-M3) and robusticity indices (at the M, and Mz levels) of Au. deyiremeda and other 
Plio-Pleistocene hominins 


Py M, M, Ms Robusticity Robusticity 
Index atM, —_ Index at M2 
MD BL MD BL MD BL MD BL MD BL 
BRT-VP-3/14 n=1 7.6 10.8 8.2 10.6 (12.6) (12.6) (14.8) (13.8) (15.8) (13.4) 0.72 0.91 
WYT-VP-2/10 n=1 - - - - - - - - - - 0.76 - 
KNM-WT 8556 n= 1 9.9 12.2 11.4 12.8 13.6 13.1 - (15.0) (19.0) - (0.59) - 
LD 350-1 n=1 - - 8.7 10.5 (12.2) - 13.2 12.5 13.0 12.2 (0.61) (0.69) 
Au. anamensis n 6 5 6 16 8 8 iti if i 7 2 - 
mean 9.9 10.9 9.0 10.6 12.6 11.7 14.3 13.5 15.0 13.2 0.53 - 
range 9.3 - 10.9 9.5- 12.0 74-9.8 9.6- 11.9 11.6 - 13.7 10.2 - 13.3 13.0- 15.9 12.3-14.9  13.7-17.0 12.1 - 13.7 0.51 - 0.54 - 
s.d 0.60 1.00 0.95 0.84 0.87 1.10 0.92 0.88 1.17 0.66 - - 
Au. afarensis n 29 29 26 22 31 24 33 31 26 23 22 19 
mean 9.5 10.7 9.8 11.0 13.1 12.6 14.3 13.5 15.2 13.4 0.58 0.7 
range 7.9-12.6 8.9- 13.6 7.7-11.4 9.8-11.8 10.1 - 14.8 11.0- 13.9 12.1 - 16.5 11.1 - 15.2 13.4 - 18.1 11.3 - 15.3 0.48 - 0.76 0.56 - 0.88 
s.d 1.00 1.10 1.00 0.84 0.90 0.80 1.20 1.00 1.30 1.00 0.07 0.08 
Au. africanus n 18 17 24 20 32 30 37 40 34 31 4 5 
mean 9.7 11.4 10.4 11.6 13.9 13.2 15.7 14.4 16.1 14.5 0.61 0.76 
range 8.4-11.2 9.2 -13.9 9.3 - 12.3 10.3 - 13.4 12.4-15.8 10.8 - 15.1 14.0-17.8 12.7 - 16.8 12.9-18.5 12.1 - 16.8 0.56 - 0.65 0.64 - 0.97 
s.d 0.74 TAT 0.76 0.90 1.07 0.90 0.98 0.99 1.10 1.08 0.04 0.13 
Early Homo n 12 11 9 9 15 16 14 15 12 12 22 16 
mean 10.0 10.4 10.5 10.9 14.1 12.2 15.5 13.6 15.4 13.0 0.63 0.67 
range 8.9-11.1 8.0- 12.3 9.1-11.8 8.1 - 12.7 13.0 - 15.6 10.6 - 14.1 13.9 - 18.3 11.7-15.4 14.0-17.5 11.7 - 14.7 0.51 - 0.74 0.54 - 0.79 
s.d 0.71 1.36 0.90 1.42 0.89 0.94 1.28 1.07 0.96 0.94 0.06 0.07 
P. robustus n 20 19 A 16 24 18 20 20 22 21 5 5 
mean 10.2 11.6 11.5 13.1 14.7 13.7 16.3 14.7 aba 14.5 0.66 0.76 
range 9.0-11.4 9.0 - 13.7 106-126 11.5-14.8 13.2 - 16.5 11.8-15.0 14.8-17.9 12.8-16.3 15.1-20.5 12.6 -17.0 0.55 - 0.75 0.62 - 0.83 
s.d 0.61 1.14 0.62 1.11 Ont, 0.85 0.90 0.84 1.38 1.14 0.07 0.08 
P. boisei n 5 5 9 9 10 6 8 6 9 12 25 23 
mean 11.1 13.0 13.5 14.3 16.5 15.5 18.2 16.9 16.4 20.0 0.69 0.76 
range 8.9 - 13.0 11.4-13.7 10.1 - 15.6 12.3 - 16.5 15.4 - 18.6 14.4-17.6 16.4 - 20.0 15.8 - 18.6 14.7-19.2 17.6 - 22.4 0.57 - 0.80 0.63 - 0.86 
s.d 1.66 0.93 1.76 1.28 0.97 1.17 1.33 1.11 1.56 ileal 0.05 0.06 


The values reported for BRT-VP-3/14 are averages of the left and right sides. The data for LD 350-1 are taken from ref. 21 and corrected for interproximal wear. Summary statistics for the comparative taxa were 


taken from ref. 15 and the references listed in Extended Data Table 2. Values in parentheses are estimates or corrected for breakage and/or interproximal attrition. 
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Patients with high-grade serous ovarian cancer (HGSC) have experienced little improvement in overall survival, and 
standard treatment has not advanced beyond platinum-based combination chemotherapy, during the past 30 years. To 
understand the drivers of clinical phenotypes better, here we use whole-genome sequencing of tumour and germline 
DNA samples from 92 patients with primary refractory, resistant, sensitive and matched acquired resistant disease. We 
show that gene breakage commonly inactivates the tumour suppressors RBI, NF1, RAD51B and PTEN in HGSC, and 
contributes to acquired chemotherapy resistance. CCNE1 amplification was common in primary resistant and refractory 
disease. We observed several molecular events associated with acquired resistance, including multiple independent 
reversions of germline BRCAI or BRCA2 mutations in individual patients, loss of BRCAI promoter methylation, an 
alteration in molecular subtype, and recurrent promoter fusion associated with overexpression of the drug efflux 


pump MDRI1. 


Recurrent, chemotherapy-resistant HGSC' accounts for most epithe- 
lial ovarian cancer deaths. Primary disease is characterized by ubi- 
quitous TP53 mutation”’, extensive copy number change*®, and in 
about half of all HGSCs, there is evidence of mutational? and func- 
tional® inactivation of homologous recombination repair. Extensive 
intratumoural heterogeneity in primary HGSCs has been recently 
documented, as it has in other solid cancers’. However, relatively 
little is known of the genomic evolution of HGSC under the selective 
pressure of chemotherapy. 

As HGSC is driven by genomic copy number change rather than 
recurrent point mutation’, we undertook the first whole-genome 
sequence (WGS) analysis of a large HGSC cohort to examine struc- 
tural variation at high resolution. We performed transcriptome, 
methylation, and microRNA (miRNA) expression analyses to sup- 
port the WGS data, and compared resistant, refractory and sensitive 


primary disease. We also evaluated paired primary sensitive and 
relapse-resistant samples, and end-stage HGSC obtained from 
patients who underwent rapid autopsy for research. 


Driver mutation by gene breakage 


We performed whole genome sequencing (40 and 52 average cov- 
erage, germline and tumour, respectively; Supplementary Table 1), 
complemented by transcriptome sequencing (average 210 million reads 
per sample; Supplementary Table 2), and methylation, copy number 
and miRNA array analysis on a total of 114 tumour samples from 92 
patients (Methods and Extended Data Fig. 1). Confirming previous 
reports*”, TP53 mutations were prevalent (Fig. 1 and Supplementary 
Table 3). Inactivating germline or somatic mutations in genes assoc- 
iated with homologous recombination repair, or BRCA1 methylation, 
were detected collectively in half of the primary tumours (Fig. 1 and 
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Figure 1 | Genomic features of HGSC. a, Structural variants (SVs) in 80 
primary solid tumours and 12 ascites samples from 92 individual cases. 

b, Highly rearranged genomes have at least 7 chromosomes containing 
clustered breakpoint regions (blue squares), high density of breakpoints 
(orange), both previous conditions (red) or high density of inter-chromosomal 
translocations (purple). c, Genomes with less than 7 chromosomes with 


Supplementary Table 4). CCNE1 gain/amplification (19% cases) was 
largely exclusive of BRCA1/2 pathway disruption (Fig. 1). 

A total of 36,561 somatic structural variants were detected in primary 
and recurrence samples, ranging from 48 to 1,064 per tumour (Fig. la 
and Supplementary Table 5). The frequency of detected structural vari- 
ant events (Extended Data Fig. 2a) or single nucleotide variants (SNVs) 
(Extended Data Fig. 2b) was not related to tumour sample purity. We 
observed cases with highly rearranged genomes, a low number of scat- 
tered breakpoints or a high concentration of events involving one or 
few chromosomes (Fig. la, b and Supplementary Table 6). Similar to 
basal-like breast cancers’®, primary solid tumour samples with BRCA1 
inactivation had more structural variants (P < 0.0001; Extended Data 
Fig. 2c), involving numerous chromosomal regions (Fig. 1la-d). 
Tumours with BRCAI mutations also had a higher number of gen- 
ome-wide (P < 0.001) and coding (P < 0.001) small mutations 
(SNVs, and insertions or deletions (indels)) compared to homolog- 
ous-recombination-intact samples (Extended Data Fig. 2d, e). BRCAI 
mutation is frequently associated with a pronounced intratumoural 
immune response''""’, which may relate to the greater number of muta- 
tions observed in these tumours. We identified one instance of BRCA2 
disruption by chromothripsis’*, but this mutational mechanism was 
infrequent, as was the occurrence of breakage-fusion-bridge cycles’* 
(Fig. 1c, Extended Data Fig. 3 and Supplementary Information). 
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clustered breakpoints show localized complex rearrangements 
(chromothripsis, pink; breakage-fusion-bridge, pale pink; and amplification 
linked by translocations, purple) or scattered breakpoints. HR, homologous 
recombination. d, Significantly mutated driver genes (IntoGen q-value <0.01) 
and homologous recombination repair genes disrupted by rearrangements. 


Additional highly complex focal events were observed, including a 
pattern of amplified loci linked by high-density inter-chromosomal 
translocations similar to chromoplexy"®, and chromothriptic-associated 
production of double minute chromosomes'”’* (Supplementary 
Information). Chromosome 19 commonly contained clusters of loca- 
lized breakpoints, particularly samples with amplification involving 
19q12 and the CCNE1 locus (Fig. 1b, c). 

Previous exome analyses concluded that apart from TP53, somatic 
point mutations in driver genes are infrequent in primary HGSC’, an 
observation consistent with our findings (Fig. 1d and Supplementary 
Table 7). However, as exome studies have a limited ability to detect 
gene mutation by structural rearrangement, we considered inactiva- 
tion of genes by disruption of transcriptional units (gene breakage). 
Although NFI and RBI were inactivated by truncating point muta- 
tions and indels in only 6% of primary samples (5 out of 80), inclusion 
of gene breakage raised the frequency of inactivating mutations to 
20% for NF1 and 17.5% for RB1, an observation supported by express- 
ion levels and transcriptome sequence data (Fig. 1d, Extended Data 
Fig. 4 and Supplementary Table 8). Most involved breakage without 
deletion, and therefore would not have been detected by copy analysis. 
Gene inactivation by breakage was also seen for PTEN and RAD51B 
(Extended Data Fig. 4). 
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Wesearched for gene fusion events and identified 822 genes involved 
in intergenic fusion events potentially capable of producing a fused 
transcript but without evidence of recurrent, biologically plausible dri- 
ver events in primary HGSC (Supplementary Information). We did not 
detect any evidence supporting previously reported ESRRA and 
Cl1lorf20 (also known as TEX40)"’, and CDKN2D and WDFY2 (ref. 
20) fusions in our cohort (Supplementary Information). 


Mutational signatures 


Analysis of mutations across cancer genomes can reveal distinct pat- 
terns of single nucleotide substitution, many of which are associated 
with carcinogen exposure or mutational processes operative during 
tumorigenesis’. For most samples, either the previously described” 
age- or BRCA-associated signatures were dominant (Fig. 2 and 
Extended Data Fig. 5), with a close relationship between the age 
signature, patient age at diagnosis (P < 0.0001; Extended Data Fig. 
6a) and CCNE1 gene amplification (P < 0.01; Extended Data Fig. 6b). 
In a small number of cases, the APOBEC or mismatch repair signa- 
tures were apparent (Extended Data Fig. 5 and Supplementary 
Information). The BRCA mutational signature was dominant in all 
34 samples with germline or somatic inactivation of BRCA1 or BRCA2 
(P < 0.0001), however, this association was less consistent in the small 
number of patients with other homologous recombination pathway 
mutations (Extended Data Fig. 6c). Similarly, primary sensitive cases 
were found to show the highest BRCA signature contribution 
(Extended Data Fig. 6d). A group of 12 patients with a dominant 
BRCA signature, but no mutation in the homologous recombination 
pathway, was identified (Fig. 2). Eleven of these patients were either 
resistant or refractory to primary treatment, consistent with intact 
homologous recombination function. Therefore, although the 
BRCA signature sensitively detected germline or somatic BRCA1/2 
mutations, it could also be dominant in samples in which there was no 
evidence of mutational or clinical characteristics suggestive of a 
homologous recombination pathway defect. 
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Clinical associations 


We compared patterns of structural variation, gene expression, miRNA 
and methylation in primary tumour samples in the context of response 
to primary chemotherapy. Consistent with our previous findings”, 
amplification of 19q12 involving CCNE1 was the dominant structural 
variant associated with primary treatment failure, while most 
patients with germline or somatic mutation in BRCAI or BRCA2 had 
a favourable response to treatment (Extended Data Fig. 7a, b and 
Supplementary Information). Only two refractory patients (2 out of 
12, 16.7%) had evidence of homologous recombination deficiency, both 
involving somatic methylation of BRCAI (Extended Data Fig. 5), 
consistent with the previous observation’ that methylation is not func- 
tionally equivalent to germline mutation in mediating platinum sens- 
itivity. Although somatic mutation frequency (SNVs and indels) was 
significantly lower in resistant (P < 0.05) and refractory (P < 0.01) 
cases compared to sensitive samples (Extended Data Fig. 7c, d), other 
than CCNE1 amplification and BRCA1/2 mutation, we did not observe 
notable differences in the type or frequency of driver mutations, pat- 
terns of methylation or miRNA expression in the clinical subsets 
(Supplementary Information). We investigated the approximately 
one-third of tumours with neither a discernable defect in the homo- 
logous recombination pathway, nor CCNE1 amplification or gain. Both 
progression-free and overall survival in these patients were poor rela- 
tive to the homologous recombination defective group, similar to 
CCNE1 gained/amplified patients in both our study and The Cancer 
Genome Atlas (TCGA) independent data set? (Extended Data Fig. 7b). 
There was no evidence for high expression of CCNE1 (Extended Data 
Fig. 7e) or over representation of other driver events in this group. We 
previously reported” and validated’** four molecular subtypes of 
HGSC defined by distinct gene expression profiles and clinical out- 
come. Molecular subtype analysis of the poor-prognosis, homolog- 
ous-recombination-intact/CCNE1 wild-type tumours, showed an 
underrepresentation of the favourable outcome C2/immune subtype 
in both our series and TCGA (Extended Data Fig. 7f). 
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Figure 2 | Mutational signatures of primary high-grade serous ovarian 
cancer. Bars represent BRCA (blue) and age (green) signature mutation rates 
for 80 primary tumours. Coloured squares indicate primary response to 
chemotherapy and dominant mutational signature. Deleterious germline and 
somatic mutations in the homologous recombination pathway are shown, as 
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well as cases with CCNE1 gain or amplification, and one case with a germline 
mismatch repair gene (PMS2) mutation. Samples are grouped by mutated gene 
and ordered left to right by descending BRCA signature mutation rate. Mb, 
megabases. 
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Acquired resistance 


Surgery or biopsy is rarely performed in recurrent HGSC and so we 
relied on collection of tumour cells from ascites, drained to relieve 
abdominal distension, to examine relapsed disease. We focused on 
patients who were sensitive to initial treatment but had failed sub- 
sequent therapy (Supplementary Information). Relapse samples were 
found to have a higher mutational burden (SNVs and indels) at the 
coding and non-coding level than matched primary samples (Fig. 3a). 
In principle, this may reflect wider representation of tumour deposits 
within ascites, however, few differences were observed between prim- 
ary ascites and primary tumour samples (Extended Data Fig. 8a, b). 
We observed a significant relationship between the number of non- 
silent coding SNVs unique to the relapse samples and the number of 
courses of platinum-based chemotherapy the patient received (P = 
0.046; Extended Data Fig. 8c, d), suggesting that HGSC continue to 
evolve during treatment. To evaluate the effect of chemotherapy on the 
HGSC genome, we investigated mutations that were unique to the 
relapse samples. We observed distinct changes in the frequency of 
mutations in certain trinucleotide contexts in the relapse samples, with 
a significant increase in C(C>T)C mutations across the 15 paired cases 
(P = 0.023) (Fig. 3b). We also observed an increase in the frequency of 
dinucleotide substitutions in resistant (post-treatment) samples 
(Supplementary Information). However, we found no evidence of plat- 
inum-induced mutations specifically occurring in driver genes. We 
interrogated non-silent coding mutations that were unique to the 
relapse samples to find those that may be involved in the development 
of resistance. We identified 32 genes that were mutated in more than 
one of the paired cases; however, none was clearly linked to chemore- 
sistance (Extended Data Fig. 8e and Supplementary Information). 
We investigated other possible mechanisms of resistance, including 
reversion of germline mutations in BRCA1 or BRCA2 (ref. 25). We 
identified five cases with reversion mutations (Extended Data Fig. 9a), 
including two patients where more than one reversion mutation was 
identified (Extended Data Fig. 9). A notable example involved a rapid 
autopsy patient with a BRCA2 germline mutation in which five inde- 
pendent reversion events were detected at death (Supplementary 
Information). This patient was non-responsive to the PARP inhibitor 
olaparib and to carboplatin given after her disease recurrence 
(Extended Data Fig. 9c). We subsequently performed deep amplicon 
sequencing flanking the BRCA2 germline mutation and reversion 
sites in additional metastatic deposits from this patient, and identified 
at least a further seven high confidence reversion events (Fig. 4a, 
Extended Data Fig. 9d, e and Supplementary Information). We found 
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Figure 3 | Somatic mutational patterns in acquired resistant cohort. 

a, Number of mutations in paired primary and relapse samples that are unique 
or shared at whole genome level (top) and restricted to non-silent coding 
mutations (bottom). b, Difference in mutation frequency in post-treatment, 
relapse-resistant samples compared to paired samples collected earlier during 
disease journey. SNVs were examined in their trinucleotide sequence context. 
Asterisk indicates statistical significance of C(C>T)C using a test of equal 

or given proportions across the cohort after a Fisher’s exact test (two-sided) 
on individual cases (P <0.05). 
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no evidence of a deletor phenotype in the WGS data from this patient 
that might explain the propensity for reversion of the germline 
BRCA2 allele (Supplementary Information). Evaluation of reversion 
allele frequency showed that several subclonal reversion events were 
detected in individual deposits, particularly in the abdomen where the 
presence of ascites may facilitate the spread of tumour cells” 
(Extended Data Fig. 9e). Reversion of mutant alleles to a wild-type 
sequence has been reported previously**”’. Minor subclones that have 
reverted to a wild-type sequence would be difficult to distinguish from 
contaminating stromal cells and therefore additional reversion events 
may be present at a low level. 

In the second autopsy case, there was no evidence of reversion of 
the BRCA1 germline allele. The primary tumour consisted of sheets of 
epithelial cancer cells with limited stroma and was classified as C2/ 
immune subtype” by transcriptome profiling, consistent with our 
previous data showing enrichment of the C2/immune subtype in 
BRCAI patients''. However at autopsy, histological examination 
revealed an extensive desmoplastic stromal reaction (Fig. 4b) and a 
C1 subtype (Supplementary Information). Fibrotic stromal reactions 
are prominent in the poor outcome C1 molecular subtype of primary 
HGSC”*, and have been associated with poor drug uptake and primary 
chemoresistance in other solid cancers”*. 

BRCAI promoter methylation is an important somatic driver in 
~11% of HGSCs”’. We identified one patient in which the primary 
sensitive sample showed extensive promoter methylation and low 
BRCAI expression (Fig. 4c), while the relapse sample had lost 
BRCA1 methylation and the gene was expressed at comparable levels 
to homologous-recombination-intact tumours. Comparison of global 
methylation patterns of primary and recurrence samples suggested a 
specific rather than generalized altered methylation status at relapse 
in this patient (Supplementary Information). 

We observed that on average there were more structural variants 
(~1.6 times) detected in recurrence than in primary samples 
(Supplementary Table 5). We therefore searched for rearrangements 
that were unique to the post-treatment samples and that could influ- 
ence chemotherapy response. Gene breakage was identified in pro- 
apoptotic genes FOXO1 and BCL2L11, each unique to the relapse 
sample of a single case (Supplementary Table 8). Breakpoints in each 
gene were identified in a further two primary chemoresistant samples 
(Supplementary Information). 

We also sought instances of capture and rearrangement of pro- 
moter sequences that may drive expression ofa partner pro-resistance 
gene. Among the 114 samples, we identified 36 events in 30 unique 
genes. These included two recurrent promoter fusions involving 
ABCB1, which encodes the drug efflux pump MDRI. In these 
tumours, an intergenic deletion of ~250 kilobases (kb) juxtaposed, 
head-to-tail, the promoter and non-coding exon 1 of SLC25A40 with 
exon 2 of ABCB1, resulting in a fused transcript (Fig. 4d and Extended 
Data Fig. 10a). There was no evidence that these events were present 
in matched primary, sensitive samples. Analysis of transcriptome data 
showed an increase in expression of ABCB1 in both recurrent samples 
with the putative fusion, and a corresponding decrease in SLC25A40, 
relative to matched sensitive samples (Extended Data Fig. 10b). 
Indeed, the expression of ABCB1 in these cancers was among the 
highest found in the cohort (Fig. 4e). Expression of a fused transcript 
for one of the samples where remaining material was available was 
confirmed by reverse transcription PCR (RT-PCR) (Extended Data 
Fig. 10c). Among the transcriptome data for the relapse samples we 
identified five additional outliers with high ABCB1 expression. Of 
these, three had translocations or insertions involving the 5’ end of 
ABCB1 (Extended Data Fig. 10a), which may have affected the regu- 
lation of ABCB1. Using primers specific for the predicted fusion, RT- 
PCR and Sanger sequencing, we detected a fusion product in one of 
these samples, suggesting the presence of an additional subclone not 
apparent at the coverage obtained by WGS (Supplementary 
Information). We evaluated an additional 51 samples collected from 
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Figure 4 | Molecular changes associated with acquired resistance. a, In total, 
11 high-confidence reversion events were identified in 16 out of 18 samples 
collected at autopsy in a BRCA2 carrier. Additional large deletion event 
identified by WGS not shown. b, Haematoxylin-and-eosin-stained primary 
(left) and autopsy (right) tumour sections showing epithelial sheets and 
extensive stromal reaction, respectively. Original magnification, 200. 

c, BRCA1 promoter methylation and low expression in AOCS-091 primary 
sensitive sample compared to hypomethylation and increased expression in 


women with recurrent HGSC and found a further 4 independent 
fusion events (Extended Data Fig. 10d), all involving samples with 
above median ABCB1 expression (Extended Data Fig. 10e). MDR1 
(encoded by ABCB1) is an efflux pump for various chemotherapeutic 
agents used in the treatment of ovarian cancer including paclitaxel, 
etoposide and doxorubicin. All patients with the fusion had been 
exposed, and their tumours failed to respond, to one or more 
MDRI substrates (Fig. 4f). 


Discussion 


We performed whole-genome analysis of a disease that is character- 
ized molecularly by extensive structural genomic variation and clin- 
ically by initial chemo-sensitivity, followed by the frequent emergence 
of chemotherapy resistance. Despite our finding that gene breakage 
substantially increases the frequency of recurrent mutations in NF1 
and RB1, few genes other than TP53 are recurrently mutated in HGSC 
at a high level. The lack of driver protein-coding fusions in primary 
tumour samples indicates that these are less common than previously 
suggested’?”?. 

While the prevailing view is that primary HGSC lacks recurrent 
actionable point mutations, the situation in relapse disease is 
unknown. We did not detect recurrent point mutations that are cur- 
rently actionable in relapse samples, suggesting that, at best, only low 
frequency events are likely to be uncovered using personalized geno- 
mic evaluation of patients with recurrent HGSC. A recent analysis of 
DNA changes in Caenorhabditis elegans exposed to platinum 
recorded changes in pyrimidine-rich regions”, similar to those we 
observed here in post-treatment samples. Although there seems to 
be a chemotherapy imprint on the tumour genome that includes non- 
silent coding changes, these did not result in mutation of known 
driver or resistance genes in our series. 


O No fusion observed 


Validation:conort @ Fusion observed 


matched resistant relapse sample. Grey dots represent all other primary 
tumours (n = 80). d, Schematic of deletion and SLC25A40-ABCB1 predicted 
transcript. e, Box plots of ABCB1 expression; lines indicate median and 
whiskers show range (***P < 0.001, ****P < 0.0001, two-sided t-test). 

f, Treatment response in patients with SLC25440-ABCB1 fusion. Primary 
treatment (line 1) includes both surgery and chemotherapy, and CA125 
response cannot distinguish between effects of the two treatments. 


We observed a range of molecular changes associated with acquired 
chemoresistance. Reversion of BRCA1 and BRCA2 germline alleles has 
been reported previously~**"*’, and we extend these findings with a 
report of an apparent reversion of a methylated BRCA1 allele in one 
patient. Analogous to convergent mutation of SETD2 in renal cancer’, 
we found multiple reversion events in two patients, including one at 
death with 12 high confidence independent BRCA2 reversion events, 
underscoring a daunting potential for evolution of resistance in HGSC 
genomes. We note that this patient did not have debulking surgery and 
therefore may have had a large tumour pool from which reverted clones 
could arise under selection. It is of interest to determine whether the use 
of neoadjuvant chemotherapy™ in mutation carriers increases the risk 
of emergence of reverted germline alleles. A surprising feature of this 
case was that multiple reversion events were found in individual 
tumour deposits, whereas distinct foci with separate, largely monoclo- 
nal events may have been expected. Furthermore, clones were shared 
between metastatic sites, indicating metastasis to metastasis seeding 
and possible cooperative interactions between clones. The multiple 
reversions seen in one end-stage patient contrasted with no BRCAI 
reversion in another patient, where an extensive desmoplastic stromal 
reaction was observed at autopsy. Tumour desmoplasia has been assoc- 
iated with poor drug uptake and chemoresistance in pancreatic can- 
cer’’, and it is possible that the stromal reaction in this patient allowed 
tumour survival despite apparent retention of a homologous-recom- 
bination-defective phenotype. 

The ABCBI gene, encoding the multidrug-resistant protein 1 
(MDR1), mediates rapid efflux of many chemotherapeutic agents 
including paclitaxel”, which forms part of standard primary chemo- 
therapy in HGSC. We found up-regulation of this transporter appar- 
ently through promoter fusion and translocation involving the 5’ 
region of the gene, in approximately 8% of HGSC recurrence samples. 
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Inhibition of MDR1 activity has not proven to be an effective strategy 
so far, owing to a considerable increase in paclitaxel toxicity*®°’. 
Identification of patients with this novel fusion event may allow clin- 
icians to prioritize treatment with chemotherapy that is not a sub- 
strate of MDR1, more targeted use of MDRI inhibitors, and use of 
new PARP inhibitors that are poor MDR1 substrates**. 

Collectively, our findings underscore the heterogeneity and appar- 
ent adaptability of the HGSC genome under the selective pressure of 
chemotherapy, and indicate that overcoming resistance to conven- 
tional chemotherapy will require a diversity of approaches. 

Note added in proof: The C > T mutations in pyrimidine-rich regions 
we observed in post-treatment samples closely resembled those associ- 
ated with temozolomide treatment of glioblastoma patients”. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Patient cohort description. The study population consisted of women diagnosed 
with epithelial ovarian, primary peritoneal or fallopian tube cancer between 1992 
and 2012. The women were treated at hospitals across Australia and were 
recruited through the Australian Ovarian Cancer Study (AOCS)” or through 
the Gynaecological Oncology Biobank at Westmead Hospital in Sydney. Four 
primary refractory cases were obtained from the Hammersmith Hospital 
Imperial College (London, UK) and the University of Chicago (Chicago, USA). 
Ethics board approval was obtained at all institutions for patient recruitment, 
sample collection and research studies. Written informed consent was obtained 
from all participants in this study. 

All patients were diagnosed with serous carcinoma of high-grade (grade 2 or 
grade 3) and advanced stage (FIGO stage III or IV, International Federation of 
Gynaecology and Obstetrics) and received platinum-based chemotherapy as part 
of primary treatment. In total, 81 out of 92 (88%) patients also received a taxane at 
first line treatment; 86 out of 92 (93%) patients received a taxane during the 
course of their clinical care. 

From previously published studies we estimated that with 92 unique patients, 
we would expect to achieve 90% power for 90% of genes to detect mutations that 
occur at a frequency of ~10% above the background rate for ovarian cancer 
(assuming a somatic mutation frequency of <2 per Mb)”. 

Additionally, as normal controls, 7 fallopian tube samples were obtained from 
Dana-Farber Cancer Institute (Boston, USA). Five samples were fallopian tube 
secretory epithelial cells isolated from dissociated fallopian tube fimbriae and 
expanded in short-term culture in WIT medium (Stemgent), two samples were 
fallopian tube epithelium collected from dissociated fallopian tube fimbriae with- 
out expansion in vitro. 

The validation cohort of relapse ascites samples were collected from patients 

recruited through the AOCS with high-grade and advanced stage disease. In total, 
50 out of 51 (98%) patients received platinum-based chemotherapy as part of 
primary treatment; 46 out of 51 (90.2%) patients also received a taxane at first line 
treatment; 47 out of 51 (92.2%) received a taxane during the course of treatment. 
Clinical definitions. Progression-free survival was the time interval between 
histological or cytological diagnosis and disease progression based on GCIG 
(Gynaecological Cancer Inter Group) CA125 criteria“, imaging or clinical evalu- 
ation. Overall survival was defined as the interval between histological or cyto- 
logical diagnosis and death from any cause. A response to second and subsequent 
lines of chemotherapy was defined as at least a 50% decrease in CA125 from an 
increased pre-treatment level, confirmed and maintained for at least 28 days*’. A 
complete response to second and subsequent lines of chemotherapy was defined 
as normalization of CA125 from an increased pre-treatment level, confirmed and 
maintained for at least 28 days. 
Inclusion criteria for treatment response groups. Primary resistant and 
refractory: This group included patients with evidence of disease progression 
within ~6 months from the end of primary treatment. An additional level of 
stringency was added by only including patients that demonstrated no response 
to subsequent lines of chemotherapy. A subset were designated as refractory, 
which included patients with disease progression while on primary treatment, 
or within one month of the end of primary treatment’, evidenced by persistently 
elevated CA125 or clinical progression. 

Primary sensitive: Patients with no evidence of disease progression within 6 
months of the end of primary treatment or patients that demonstrated responses, 
either normalization of CA125 or a 50% decrease in CA125 (from an increased 
pre-treatment level), to multiple lines of platinum-based chemotherapy. 

Acquired resistance: Patients that failed to respond to chemotherapy for relapsed 
disease having previously demonstrated sensitivity to earlier lines of chemotherapy. 
Initial sensitivity was inferred if patients with macroscopic residual disease after 
initial surgery had no progression within 6 months from the end of primary treat- 
ment and/or had a complete or 50% response to chemotherapy for second and 
subsequent lines of treatment, before the development of resistance (that is, no 
response to later lines of chemotherapy) and collection of relapse sample. 

Nucleic acid isolation. DNA was isolated from peripheral lymphocytes or lym- 
phoblastoid cell lines using the salting out method for normal DNA. For tumour 
DNA and RNA isolation, frozen tissue sections were cyrosectioned, with a section 
for haematoxylin and eosin staining being taken before and after serial sectioning 
for assessment of tumour content. For samples containing >70% tumour, 4 < 50 
um and 12 X 100 pm whole frozen tissue sections were used for DNA and RNA 
extractions, respectively. For samples containing <70% tumour, needle dissec- 
tion of tumour cells was performed on up to 100 X 10 tm frozen sections for 
DNA and RNA extractions. Tumour cells were isolated from ascites using 
Dynabeads Epithelial Enrich (Invitrogen/Life Technologies), followed by DNA 
and RNA extractions. DNA extractions were performed using the DNeasy blood 
and tissue kit (QIAGEN). DNA was quantified using the Qubit dsDNA BR assay. 
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RNA was isolated using the mirVana miRNA Isolation kit (Ambion/Life 
Technologies). RNA quality was assessed using the RNA 6000 Nano kit on the 
BioAnalyzer (Agilent) and the NanoDrop was used to assess quality and deter- 
mine RNA quantity. 

SNP arrays and copy number analysis. Tumour and matched normal DNA was 
assayed with the Omni 2.5-8, V1.0 and V1.1 IlluminaBeadChips as per manu- 
facturer’s instructions (Illumina). Single nucleotide polymorphism (SNP) arrays 
were scanned on an iScan (Illumina), data was processed using the Genotyping 
module (v.1.9.4) in GenomeStudio v.2011.1 (Illumina) to calculate B-allele fre- 
quencies (BAF) and logR ratios. Genome alteration print** was used to call 
somatic regions of copy number change after low-quality probes assessed in 
the matched normal sample with GenCall (GC) score of <0.7 were removed. 
Commonly affected genes in regions with significant (q < 0.05) gain and loss were 
determined using GISTIC v2.0 (ref. 44) and genes within regions of copy number 
change were annotated according to the Ensembl v70 gene model. 
Whole-genome library and sequence generation. Tumour cellularity was 
assessed using SNP array data and qPure* indicating a median tumour content 
of 85.3% (range 49.7-99.8). Matching germline DNA from blood mononuclear 
cells or lymphoblastoid cell lines were analysed for all patients. Sequence libraries 
were generated from 1 jig of genomic DNA using the standard library preparation 
technique as in the protocol in the TruSeq DNA PCR-Free Sample Preparation 
Guide for the TruSeq DNA Sample Preparation Kit (FC-121-2001, Illumina). 
Sequencing was carried out by the Illumina Genome Network to a minimum 
average of 30-fold base coverage for germline samples and 40-fold coverage for 
tumour samples. 

WGS data processing and quality control. Each lane of sequencing data under- 
went alignment to the Genome Reference Consortium human genome 
assembly (GRCh37) using multi-threaded BWA“ 0.6.2-mt resulting in sorted 
lane level files in sequence alignment/mapping (SAM) format then compressed 
and converted to binary file (BAM) created by Samtools*” 0.1.19. Whole 
sample level merged BAMs, one each for matched germline and tumour samples 
were produced by in-house tools and optical duplicate reads marked using 
Picard MarkDuplicates 1.97 http://picard.sourceforge.net. Quality assess- 
ment and coverage estimation was carried out by qProfiler and qCoverage 
(http://sourceforge.net/projects/adamajava). To test for the presence of sample 
or data swaps all sequence data were assessed for concordance at approximately 
1.4 million polymorphic genomic positions including the genotyping array data 
by qSignature. 

Transcriptome sequence data generation. Between 500 ng and 2 jg of total 
RNA (RNA integrity number >7, except for autopsy samples) was used for 
library preparation using the TruSeq RNA Sample Preparation v2 kit 
(Illumina) as per the manufacturer’s instructions using the Low-Throughput 
protocol. All libraries were sequenced as paired-end 100-bp on a HiSeq2000 
instrument (Illumina). 

Transcriptome sequence data processing and quality control. Each barcode- 
separated lane of sequencing data underwent three streams of processing. To 
assess quality each was aligned to the Genome Reference Consortium human 
genome assembly (GRCh37) using multi-threaded BW A“ 0.6.2-mt. RNA-seQC* 
was used to investigate RNA sequencing quality and concordance to all other 
sequencing was assessed at approximately 1.4 million polymorphic genomic posi- 
tions by qSignature (http://sourceforge.net/projects/adamajava). Estimation of gene 
and transcript abundance was carried out using RSEM* for the Ensembl v70 gene 
and transcript model. The ‘expected_count’ values were transformed to log»-counts 
per million (logxCPM) and the mean-variance relationship estimated to compute 
appropriate observational-level weights using the methods available in R-package 
Limma (http://genomebiology.com/2014/15/2/R29). A quantile normalization step 
was applied to normalize between samples before downstream analysis using the 
methods available in R-package Limma. A further gapped alignment and analysis 
with Mapsplice” was also carried out in order to identify insertions and deletions as 
well as differential splicing patterns and gene fusion transcripts. 

MicroRNA expression analysis. MicroRNA expression levels were examined 
using the nCounterHuman v2.1 miRNA expression analysis kit (NanoString 
Technologies), with 100 ng total RNA as input for hybridization to reporter/ 
capture probes. Purification was performed on the nCounter Prep Station and 
detection using the nCounter Digital Analyzer (NanoString Technologies), as per 
the manufacturer’s instructions. 

The nSolver Analysis Software (NanoString Technologies) was used to extract 
the raw counts for all samples that passed internal QC metrics, before normal- 
ization using the housekeeping genes to control for sample input. 

Methylation. Methylation levels across the genome were examined on the 
Infinium Human Methylation 450 Bead Chip (Illumina), the samples were pro- 
cessed at the Australian Genome Research Facility. The data were background 
corrected, normalized to internal controls and quality control was performed at 
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the probe and sample level. A total of 572 probes failed in greater than 25% of the 
samples owing to their detection P values being greater than 0.05, and these were 
removed from the data’!. COMBAT was used to remove batch effects”. 
Somatic SNV variant detection. High confidence somatic substitutions were iden- 
tified as the intersect of the post filtered output of the mutation detection tools g3NP* 
and the Unified Genotyper** (GATK). In addition, supporting evidence for variants 
was sought in matched independent and/or orthogonal data sets such as long mate 
pair sequencing data produced on Life Technologies SOLiD platform and HiSeq 
RNA transcriptome sequencing. Annotation of variants was carried out against 
Ensembl v.70 and consequence prediction using Ensembl perl API modules. Two 
methods, Mutsig® and IntOGen*’, were used to identify significant, recurrently 
mutated genes in the primary tumour data set (n = 80). 

Somatic short insertion and deletion variant detection. Pindel”’ was used to 
identify indels of 1-50 bp in length and variant filtering, annotation and prior- 
itization was carried out as previously described'*. Manual review of both germ- 
line and somatic variants in genes highlighted in this study was carried out using 
Integrative Genomics Viewer (IGV)°**. Gene model annotation of variants was 
carried out against Ensembl v.70 and consequence prediction using Ensembl perl 
API modules for variant effect prediction across all transcripts. 

Structural variant detection. Structural variants of length 50 bp or greater were 
identified using qSV'* (http://sourceforge.net/projects/adamajava), which dis- 
covers somatic structural rearrangements from tumour and matched normal 
sequencing using three methodologies: discordantly mapped read pairs; clusters 
of soft clipping; and a split contig alignment generated from localized de novo 
assembly of unmapped plus aberrantly mapped reads. Only high confidence 
somatic structural variants were used in the analysis defined as events that were 
identified by more than one of the detection methodologies or that had at least 10 
supporting reads, and where no evidence of the event was identified in the 
matched germline sample including low quality data. 

Breakpoint annotation and consequence predictions were performed against the 
Ensembl v70 gene model to calculate the consequences for flanking genes or frag- 
ments of genes. Breakpoints were also integrated with the segmented copy number 
data to inform characterization: intrachromosomal breakpoints flanking a copy 
number segment of loss were annotated as deletions; those linking segments of 
similar DNA copy number were annotated as intrachromosomal translocations; 
duplications and inversions associated with increases in copy number enabled char- 
acterization of tandem duplications and amplified or foldback inversions. 

Somatic variant verification. In-house developed software examined genomic 
positions of variants in the matched data sets scoring supporting read evidence 
and assessing coverage in a similar manner to methods used in a recent genomic 
landscape paper for cervical carcinoma”. In addition for structural variants, SNP 
array data provided orthogonal evidence in support for structural variant events 
that cause a copy number shift when boundaries of copy number change were 
identified in close proximity to breakpoints. 

Mutational signatures and detection of kataegis. Mutational signatures were 
deciphered using the framework developed by previously” (http://www.mathworks. 
com/matlabcentral/fileexchange/38724). Loci of localized hyper-mutation, termed 
kataegis, were identified by calculating the inter-mutation distances between the 
genomic position of sorted, high-confidence somatic substitution variant calls for 
each sample. Inter-mutation distances were then segmented using piecewise constant 
fitting to find regions of constant inter-mutation distance. Parameters used for 
piecewise constant fitting were c525 and kmin52 (ref. 60). Putative regions of kataegis 
were identified as those segments containing six or more consecutive mutations with 
an average inter-mutation distance of less than or equal to 1,000 bp. 

Detection of breakpoint clustering. Chromosomes containing highly significant 
non-random distribution of breakpoints were identified as recently described*' 
with a stringent threshold of P < 0.00001. Chromosomes with remarkably high 
numbers of rearrangement events were identified as outliers defined as a break- 
point per megabase rate exceeding 1.5 times the length of the inter-quartile range 
from the seventy-fifth percentile for each sample with a minimum threshold of 35 
breakpoints per chromosome. Chromosomes identified to have both characteristics 
were inspected against criteria for chromothripsis” or the breakage-fusion-bridge 
cumulative rearrangement model”. Similarly, chromosomes with high numbers of 
translocations were identified with a minimum threshold of 10 translocation break- 
points per chromosome following observations of localized events. 

Molecular subtyping. Tumour molecular subtypes were classified using prev- 
iously described expression profiles” as the learning set. Expression of genes 
between each high-grade serous molecular subtype was compared to identify 
subtype specific gene signatures. All genes differentially expressed at a FDR < 
5% were selected as subtype specific genes. The union of all subtype specific gene 
sets were used as the features of the classifier. The normalized data from the 
learning set and transformed data from the RNA-seq data (described in detail 
in the previous section on transcriptome data processing) were further scaled 
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independently so that every gene in the two sets had zero mean and unit variance. 
Finally, the transformed gene expression profiles were used to identify the 
molecular subtypes of samples in the cohort using k-nearest neighbour classifica- 
tion algorithm. The accuracy of the classification procedure was estimated based 
on the samples of known molecular subtypes, as defined by previous microarray 
experiments”. These samples were removed from the training set and the clas- 
sification procedure repeated. Of the 19 samples with known molecular subtypes 
in the cohort, 17 were classified correctly (89.47% accuracy). 

Verification and validation of SLC25A40-ABCB1 fusion transcript. A cohort 
of 51 relapse ascites samples was used to examine frequency of SLC25A40-ABCB1 
fusion. RNA from ascites samples was reverse transcribed into cDNA using random 
primers (Promega) and M-MLV reverse transcriptase (Promega). Quantitative 
reverse transcription PCR was performed to examine SLC25A40 and ABCB1 
expression, with GAPDH and ACTB for normalization, using the following primers: 
ABCBI_forward 5'-GAGAGATCCTCACCAAGCGG-3’, ABCBI_reverse 5'-CG 
AGCCTGGTAGTCAATGCT-3’, SLC25A40_forward 5’-AGGGAACATTGGAT 
GCATTTTT-3’, SLC25A40_reverse 5’-AGCACTTAATTGATCATAGCAGGT-3’. 

The AAC, method was used to calculate expression levels in ascites compared 
to expression in the SKOV3 cell line’. 

Testing and verification of the presence of the SLC25A40-ABCBI fusion transcript 
was performed using nested PCR with primers to exon 1 of SLC25A40 and exon 3 of 
ABCBI: outer_forward 5’-CGGCTCTGTGTTGACCAAAC-3’, outer_reverse 5’- 
TCTTTGCTCCTCCATTGCGG-3’; internal_forward 5'-CCCGTCACCAGGGTT 
ATTCC-3’, internal_reverse 5’-CCCCTTCAAGATCCATTCCGA-3’. 

PCR conditions were as follows: 98°C for 30 s, 30 cycles of 98°C for 10 s, 60°C 
for 30 s, 72°C for 10 s, and 72°C for 10 min. PCR product from first PCR was 
purified using QIAquick PCR purification kit (QIAGEN), PCR product from 
second PCR was run on a 2% agarose gel, bands were excised and purified using 
Wizard SV Gel and PCR Clean-up System (Promega) before Sanger sequencing 
using internal primers. 

Examination of BRCA2 reversion events in AOCS-167. Primers to flank the 
germline and reversion events identified by WGS were used to amplify the regions of 
interest for next-generation sequencing of additional tumour samples from AOCS- 
167. Each of the gene-specific primers were generated with a common sequence (CS) 
on the 5’ end. The paired end sequence (PE-CS) primers add the Illumina adaptor 
sequences and a sample specific 10-bp barcode (BC) to the libraries. 

BRCA2_1F 5'-TCATGATGAAAGTCTGAAGAAAAATGA-3’; BRCA2_1R 5’- 
GAGCAAGACTCCACCTCAAA-3’; BRCA2_2F 5'-AATTTTTGCAGAATGT 
GAAAAGCT-3'; BRCA2_2R 5'-ACTAAACAGAGGACTTACCATGACT-3’; 
BRCA2_3F 5'-ACTGTTTCTATGAGAAAGGTTGTGA-3’; BRCA2_3R_5’- 
TCCAATGTGGTCTTTGCAGC-3’; BRCA2_4F 5'-AGCACATTCTACATAA 
ACTGTTTCT-3'; BRCA2_4R 5'-TGACTTTCCAATGTGGTCTTTGC-3’; CS_F 
5'-ACACTGACGACATGGTTCTACA-3’; CS_R 5'-TACGGTAGCAGAGACT 
TGGTCT-3’; PE-CS_F 5'-AATGATACGGCGACCACCGAGATCT-3’; PE- 
CS_R 5'’-CAAGCAGAAGACGGCATACGAGAT-BC-3’. 

The first PCR used the CS-gene-specific primers and Phusion High-Fidelity 
DNA Polymerase (Thermo Scientific) with PCR conditions as follows: 98°C for 
30 s, 30 cycles of 98°C for 10 s, 60°C for 30 s, 72°C for 10 s, and 72°C for 10 min. 
QlAquick PCR purification (QIAGEN) was used and the product was used as a 
template for a second PCR using PE-CS primers with a unique barcode for each 
sample and PCR conditions as follows: 98°C for 10 min, 15 cycles of 98°C for 10 s, 
60°C for 30 s, 72°C for 30 s, and 72°C for 10 min. Libraries were purified using 
AMPure XP Beads and analysed on Agilent BioAnalyser High Sensitivity DNA 
chip. All 18 libraries were run on a single [lumina MiSeq flowcell. 

Sequencing data was adaptively trimmed (equivalent to the trimming per- 
formed by BWA with q = 30) then aligned to the amplicon sequences using an 
in-house bioinformatics tool (Peter MacCallum Cancer Centre Bioinformatics 
Core, unpublished). This tool utilizes the known primer sequences to align each 
read to its amplicon of origin, and where there is overlap in the forward and 
reverse a consensus sequence is generated using the highest quality base. Fastqc 
analysis was performed on the fastq files (http://www.bioinformatics.babraha- 
m.ac.uk/projects/fastqc/), the average base sequence quality was above 35 for all 
samples. mpileup files were generated from the aligned sequence using SAMtools 
0.1.18 (ref. 47) and VarScan v2.3 (ref. 62) was subsequently used to calculate the 
read depth at each position. The mean read depth ranged from 8,101-21,019 per 
sample. Indel variant calling was performed using IndelGenotyperV2 (GATK 
v1.0.4905)**, with the minimum fraction of reads with the indel set to 0.001 
and a minimum count of reads supporting the indel set to 1. All variants were 
manually reviewed in IGV. 

For verification mutation specific PCR primers were designed with 1-5 bp span- 
ning the deletion, and non-mutation specific primers used were BRCA2_2F and 
BRCA2_4R. 
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BRCA2_rev1_1R 5'-TGTGTTTTCACTGTCA-3’; BRCA2_rev1_2R 5'-TGTG 
TITTCACTGTCACA-3'; BRCA2_rev2_1R 5'-GGACTTACCATGACTTC-3’; 
BRCA2_rev2_2R 5'-GGACTTACCATGACTTCTC-3’; BRCA2_REV3_1F 5’- 
TTATACTTTAACAGGAA-3’; BRCA2_rev4_1F 5'-AAACATCAGGGAAT 
GT-3'; BRCA2_rev5_1R 5'-GTGTTTTCACTGTCTT-3’; BRCA2_rev5_2R 
5'-GTGTTTTCACTGTCTTCA-3’; BRCA2_rev6_1R 5'-ACCATGACTTG 
CAGCG-3'; BRCA2_rev6_2R 5'-ACCATGACTTGCAGCGAT-3’; BRCA2_ 
rev7_1R5'-TACCATGACTTGCAGT-3’; BRCA2_rev7_2R 5’-TACCATGAC 
TIGCAGTG-3'; BRCA2_rev8_1R 5'-GACTTGCAGCTTCTCTTG-3’; BRCA2_ 
rev9_1R 5’-CCATGACTTGCAGCTC-3’; BRCA2_rev9_2R 5'-CCATGACTTGC 
AGCTCTC-3’; BRCA2_rev10_1F 5’-GITTTATACTTTAACAGC-3’; BRCA2_ 
revl0_2F 5’-GITTTATACTTTAACAGCTG-3’; BRCA2_rev11_1F 5’-CTTTA 
ACAGGATTTGGAAT-3’; BRCA2_rev12_1F 5'-ACAGGATTTGGAAAAC-3’; 
BRCA2_rev12_2F 5'-ACAGGATTTGGAAAACAT-3’; BRCA2_rev13_1F5’-CAG 
GATTTGGAAAAAT-3'; BRCA2_rev14_1F 5’-TTGGAAAAACATCAGC-3’; 
BRCA2_rev14_2F 5'-TTGGAAAAACATCAGCTG-3’; BRCA2_rev15_1F5’-GAA 
AAACATCAGGGAAG-3'; BRCA2_rev15_2F 5'‘-GAAAAACATCAGGGAA 
GTA-3'. 

Touchdown PCR was used for mutation-specific PCR to improve sensitivity 
and specificity, PCR conditions were as follows: 98°C for 5 min, 10 cycles of 98°C 
for 10 s, 65°C for 30 s decreasing 1°C per cycle, 72°C for 10 s, 15 cycles of 98°C for 
10s, 55°C for 30 s, 72°C for 10 s and 72°C for 10 min. PCR products were run on 
2% agarose gel for visual verification of presence of PCR product of correct size. 
Code availability. All in house developed tools are available for download from 
http://sourceforge.net/projects/adamajava. 

Statistics. No statistical methods were used to predetermine sample size, and the 
experiments were not randomized. Further details of statistics are provided in the 
Supplementary Information. 
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a 
Germline CEPCCCOOEEEEOCCCOOOSOCOOOCOCOOOOOOOOOOSOOOOOOOOOOOOOOOOOOOOSOOOSOSOOOSSOSOSOSOOOSSOOOSOOSOL® 92 cases 
Primary ©0000000000090000000000009000080000000000000000000000000000000000033000000000008,, ee 
COQOQOOHOOO ©OOQQOOOQOOOOO 
Relapse i }  23.asctes 
foo} © 
Autopsy 8 2 psa 
Primary refractory Primary resistant Primary chemosensitive Acquired resistant 414 HGSC 
ni n=37 n=31 (ie ire 
Tumour Ascites Met 
Sensitive fe) © 
Resistant (2) © 0) 
b 
coresboty restorer dtemncosnative PN co camanpuman All cance 
Age 
Mean 61.7 61.3 57.6 0.188" 57.7 59.7 
Range (52.5 - 74.7) (44.5 - 78.4) (39.2 - 73) (45.5 - 70.8) (39.2 - 78.4) 
Primary site 
Ovary 8 (67%) 27 (73%) 28 (90%) 0.1152 10 (83%) 73. (79%) 
Peritoneum 4 (33%) 9 (24%) 2 (6%) 2 (17%) 17 (18%) 
Fallopian tube 0 (0%) 1 (3%) 1 (3%) 0 (0%) 2 (2%) 
Grade 
2 2 (17%) 4 (11%) 5 (16%) 0.745% 1 (8%) 12 (13%) 
3 10 (83%) 33 (89%) 26 (84%) 9 (75%) 78 (85%) 
Ungraded* 0 (0%) 0 (0%) 0 (0%) 2 (17%) 2 (2%) 
Stage 
Hl 12 (100%) 31 (84%) 25 (81%) 0.300" 10 (83%) 78 (85%) 
IV 0 (0%) 6 (16%) 6 (19%) 2 (17%) 14 (15%) 
Residual disease 
No macroscopic 1 (8%) 2 (5%) 1 (3%) 0.876" 1 (8%) 5 (5%) 
Macroscopic < 1 cm 6 (50%) 23 (62%) 18 (58%) 2 (17%) 49 (53%) 
Macroscopic >1 cm 5 (42%) 12 (32%) 12 (39%) 4 (33%) 33. (36%) 
Macroscopic (size unknown) 0 (0%) O (0%) 0 (0%) 5 (42%) 5 (5%) 
Progression free survival 
Number of events 12 (100%) 37 (100%) 24 (77%) <0.0001¢ 12 (100%) 85 (92%) 
Median months* 4.98 9.24 17.72 21.21 10.39 
95% Cl of median (4.14 - 7.04) (7.56 - 10.13) (13.28 - 21.96) (13.35 - 26.01) (9.67 - 11.47) 
Overall survival 
Number of events 11 (92%) 36 (97%) 19 (61%) <0.0001¢ 10 (83%) 76 (83%) 
Median months* 12.92 20.22 55.86 46.04 27.11 
95% Cl of median (6.41 - 34.68) (16.31 - 22.49) (45.96 - 69.07) (31.76 - 64.83) (21.50 - 34.49) 
Total cases 12 37 31 80 12 92 
Extended Data Figure 1 | Patient cohort. a, Summary of whole-genome from ascites or pleural fluid. “Time to progression or death measured from 
sequenced patients (n = 92) and samples (n = 114). b, Clinical characteristics diagnosis. “Kruskal-Wallis, "Fisher or ‘log-rank test P values comparing 
of patients by clinical response group and acquired resistant cases without primary tumour clinical groups reported. Median follow up time of cohort was 
matching primary tumour material. *Primary tumour ungraded; diagnosis 97.3 months. 
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Extended Data Figure 2 | Structural variants and somatic mutations. 
a, b, Scatter plot shows that the number of structural variants (a) and SNVs 
(b) detected per sample was not dependent on the purity of primary tumours 


(n = 80, grey) and ascites/relapse samples (n = 34, black). Spearman 


correlation P values indicated. c, Number of structural variants detected in 
primary tumours (1 = 80) grouped by homologous recombination (HR) 
mutation status: HR wild-type, BRCA1/2 altered, and HR deficient (lines 
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indicate mean; ****P < 0.0001, Kolmogorov-Smirnov test). d, e, Whole- 
genome (d) and coding mutations (e) per megabase for samples (n = 79) 
stratified by BRCA1, BRCA2 or homologous recombination pathway mutation 
status (lines indicate mean; **P < 0.01, ***P < 0.001, Kolmogorov-Smirnov 
test). Sample AOCS-166 with germline mismatch repair mutation excluded 
from analysis (see Supplementary Information section 4.3). 
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Extended Data Figure 3 | Chromothripsis and breakage-fusion-bridge. 

a, Chromothripsis affecting chromosome 13, including the BRCA2 locus 
(arrow), for a primary chemotherapy-resistant tumour. A high number of 
breakpoints (structural variants) are observed with oscillations of copy number 
(CNA and LOGRR) indicating regions of retained heterozygosity (BAF) for a 


single haplotype. b, Breakage-fusion-bridge amplification is observed on 
chromosome 1 for a primary chemotherapy-resistant tumour. A cluster of 
breakpoints (structural variants), mostly inversions, on the distal p arm are 
associated with blocks of amplification (CNA). IGV review of the tumour WGS 
confirms that the telomere region has been lost. 
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Extended Data Figure 4 | Expression of genes altered by structural 
variation. Scatter graphs show expression of RB1, NF1, PTEN and RAD51B 
plotted against copy number in primary tumours (n = 79; Spearman 
correlation analysis). Boxplots summarize expression by mutation type; 

lines indicate median and whiskers show range (two-tailed Mann-Whitney test; 
*P < 0.05, ***P < 0.001). Samples with somatic interrupting structural 
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variants and truncating mutations are indicated. Not all structural variants are 
associated with DNA loss and translocations can be observed to produce high 
expression counts for non-functional transcripts. Expression values lower than 
the median value for tumours and controls was observed for the majority of 
samples with structural variant events in RB1, NF1, PTEN and RADS1B. 
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MMR APOBE! A BRCA . 
Study ID Age Besroneed maGpanw ay, HR gene mutation Gants Gaiaieep signature ieee eae sean We miner 
group status status (CBS segment) mut/Mb mut/Mb mut/Mb mut/Mb signature 
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Extended Data Figure 5 | Molecular characteristics of primary tumours. Clinical and molecular characteristics of primary tumours sorted by BRCA signature 
contribution (n = 80). 


©2015 Macmillan Publishers Limited. All rights reserved 


a 5 b 5 
a a 
ee % g= 4 
Ba 3 wa 3 
D2 5 De » 
oS CES) 
2s 1 as 1 
«ts P<0.0001 <a P<0.005 
£ 0 £ 0 
40 50 60 70 80 05 0.0 O05 1.0 
Age (years) CCNE 1 logs R 
c d — 
15 ert 15 ae 
a Q . 
aid ¢= 
wo 10 . @o 10 
<5 ca 
ee og 
53 S35 : 
oe c3 see 
aoe Oe 0 ts 
= se) © © o 
cc oe ic o oc iam 
<= He 


Extended Data Figure 6 | Mutational signature associations. a, b, Age 
mutational signature association with age at diagnosis (a) and CCNE1 copy 
number (b) (Spearman correlation P value reported, n = 80). c, d, BRCA 
mutational signature association with BRCA1/2 and other homologous 
recombination pathway mutations (c) and primary treatment response 

(d) (lines indicate mean; Kolmogorov-Smirnov **P < 0.01, ***P < 0.001, 
****D < 0.0001; n = 80). 
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Extended Data Figure 7 | Molecular drivers and clinical associations. 

a, Percentage (n) of primary tumours (total n = 80) affected by homologous 
recombination pathway mutations and CCNE1 copy number gains. One driver 
mutation counted for samples with more than one change, ranking mutations 
in BRCA1/2, followed by other germline, somatic, amplification, deletion and 
methylation events respectively. b, Association of driver mutation subgroup 
with overall survival in AOCS and TCGA cohorts (Kaplan-Meier analysis, P 
value calculated by Mantel—Cox log-rank test). c, d, Whole genome (c) and 


coding mutations (d) per megabase for samples stratified by primary clinical 
response group (lines indicate mean). Kruskal-Wallis test P value reported 
(*P < 0.05, **P < 0.01, Kolmogorov-Smirnov test). e, Boxplots summarize 
CCNE1 expression in different driver mutation subgroups (****P < 0.0001, 
unpaired two-tailed t-test). Middle bar, median; whiskers, data range. 

f, Proportions of gene expression molecular subtypes between driver mutation 
subgroups. HR/CCNE1— subgroup has no detected homologous 
recombination pathway mutations or CCNE1 copy number changes. 
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Extended Data Figure 8 | Analysis of acquired resistance cases. two-tailed) (c), and between the number of non-silent coding mutations unique 
a, b, Matched primary ascites share most variants with primary tumour to the relapse samples and the number of lines of platinum treatment the 
samples across the whole genome (a) and for non-silent coding mutations patient received (Pearson correlation, P = 0.0456, two-tailed) (d). e, Genes with 
(b). c, d, Significant correlations are observed between the time between recurrent non-silent coding mutations that are unique to the second sample in 
collection of the primary and relapse samples and the number of lines of the acquired resistance cohort or that are driver genes reported previously”. 
platinum treatment the patient received (Pearson correlation, P = 0.0232, 
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Extended Data Figure 9 | BRCA1/2 reversion events. a, BRCA1 and BRCA2 
mutations are indicated by sample for acquired resistance cases (n = 23). Of the 
ten cases with germline mutations, five show secondary somatic mutations in 
the relapse-resistant samples that are proposed to restore gene function (that is, 
reversion). b, Two independent point mutations revert the BRCA1 nonsense 
mutation in AOCS-034. c, CA125 serum marker profile for case AOCS-167. 


d, Schematic of 12 high confidence reversion events identified in AOCS-167. 
e, Allele frequency of fifteen BRCA2 reversion events identified in deep 
amplicon sequencing across 18 relapse samples from case AOCS-167. 
Reversions observed in single deposits at low allele frequencies (=0.001) were 
considered low confidence events. 
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Extended Data Figure 10 | SLC25A40-ABCB1 fusion transcript. fusion transcript (n = 51).e, ABCB1 and SLC25A40 expression levels compared 
a, Schematic showing the locations of structural variants identified upstream or _ to the SKOV3 cell line by RT-PCR in the validation cohort and selected cases. 
internal to ABCB1 from WGS. b, ABCB1 and SLC25A40 expression levels in _ Relapse samples with the fusion are indicated by open squares and the matched 


sensitive and resistant samples of AOCS-092 and AOCS-150. c, RT-PCR primary samples indicated by closed squares; validation cases with the fusion 
verification of SLC25A40-ABCB1 fusion transcript expression in AOCS-092 are indicated by ‘+’; the median plus median absolute deviation expression 
and schematic of expected RT-PCR products. d, RT-PCR results from a level is indicated by the dotted line (n = 56). 


validation cohort of relapse ascites identifying four additional samples with the 
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A kiloparsec-scale internal shock collision in the jet 


of a nearby radio galaxy 


Eileen T. Meyer’?, Markos Georganopoulos*, William B. Sparks!, Eric Perlman’, Roeland P. van der Marel!, J ay Anderson’, 


Sangmo Tony Sohn”, John Biretta', Colin Norman!” & Marco Chiaberge 


Jets of highly energized plasma with relativistic velocities are assoc- 
iated with black holes ranging in mass from a few times that of the 
Sun to the billion-solar-mass black holes at the centres of galaxies’. 
A popular but unconfirmed hypothesis to explain how the plasma 
is energized is the ‘internal shock model’, in which the relativistic 
flow is unsteady’. Faster components in the jet catch up to and 
collide with slower ones, leading to internal shocks that accelerate 
particles and generate magnetic fields’. This mechanism can 
explain the variable, high-energy emission from a diverse set of 
objects*’, with the best indirect evidence being the unseen fast 
relativistic flow inferred to energize slower components in X-ray 
binary jets®*”. Mapping of the kinematic profiles in resolved jets has 
revealed precessing and helical patterns in X-ray binaries’®”, 
apparent superluminal motions’””’, and the ejection of knots 
(bright components) from standing shocks in the jets of active 
galaxies'*’°. Observations revealing the structure and evolution 
of an internal shock in action have, however, remained elusive, 
hindering measurement of the physical parameters and ultimate 
efficiency of the mechanism. Here we report observations of a 
collision between two knots in the jet of nearby radio galaxy 3C 
264. A bright knot with an apparent speed of (7.0 + 0.8)c, where cis 
the speed of light in a vacuum, is in the incipient stages of a col- 
lision with a slower-moving knot of speed (1.8 + 0.5)c just down- 
stream, resulting in brightening of both knots—as seen in the most 
recent epoch of imaging. 

We obtained deep V-band imaging of 3C 264, a radio galaxy 91 
megaparsecs (Mpc) from Earth with the Advanced Camera for Surveys 
(ACS) on board the Hubble Space Telescope (HST) in May 2014. The 
comparatively deep ACS imaging provides a reference image to com- 
pare against previous HST imaging for evidence of proper motions of 
the four previously known optical knots within the 2”-long jet'®"*. We 
localized over a hundred globular clusters in the host galaxy as a 
reference system on which to register previous V-band images taken 
with HST’s Wide Field Planetary Camera 2 (WFPC2) in 1994, 1996, 
and 2002. The systematic error in the registration of the WFPC2 
images is generally of the order of 5 milliarcseconds (mas) or less. 
After aligning all images to a common reference frame, the fast proper 
motion of knot B is clearly visible, as shown in Fig. 1 and 
Supplementary Videos 1 and 2. Previous radio observations have 
revealed that the initially narrowly collimated jet bends by about 10° 
at the location marked by the yellow cross'’, which appears to align 
well with the central axis of the jet in our imaging, and which serves as 
our reference point for all measured positions. 

To measure their apparent speeds, the position of each knot was 
measured with a centroiding technique. In the case of knots B and C, 
we also modelled the jet as a constant-density conical jet with super- 
imposed resolved knots, in order to measure their fluxes and positions 
better, particularly in the final epoch when they appear to overlap. We 
plotted the position of each knot along the direction of the jet axis 


1,5,6 


versus time, and fitted the data with a least-squares linear model. A 
slope significantly larger than zero (P<0.05) indicates significant 
proper motions, and we used the conversion factor 1.442c years per 
milliarcsecond to convert angular apparent speeds ju,» to dimension- 
less apparent speed relative to c, Bapp (see Table 1). 

We found that knots A and D have an apparent speed fp, consist- 
ent with zero (Extended Data Figs 1 and 2), while the inner knots B and 
Chave Papp = 7.0 + 0.8 and 1.8 + 0.5, respectively (Fig. 2). The value 
for knot B exceeds the fastest speeds measured in the jet in the nearby 
radio galaxy M87, the only other source for which speeds on kiloparsec 
(kpc) scales have been measured**’.The current difference in speeds 
between knots B and C puts them on a collision course, an interaction 


Figure 1 | A comparison of HST images of the jet in 3C 264 from 1994 to 
2014. The galaxy and core emission from 3C 264 have been subtracted, with a 
white star representing the location of the black hole, and a yellow cross 
indicating the location of a bend in the jet seen with radio interferometry’. The 
contours show the 30% flux-over-background isophotes around each peak and 
are overlaid along with vertical guidelines to aid the eye. The first three images 
were taken with WFPC2, and the final epoch is a deep ACS/WFC image taken 
for the purposes of measuring proper motions in the jet. 
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Table 1 | Summary of knot positions and speeds 


Distance along the jet 


Proper motion 


Initial (1994) Final (2014) Happ 
(mas) (mas) (mas yr *) Bapp 
Knot A 148+5 149+1 0.07 + 0.20 0.1+0.3 
Knot B 25725 359 + 2 4.85 + 0.58 70 +08 
Knot C 450+6 478 +2 1.27 + 0.32 18+0.5 
Knot D 582 + 5 58121 -0.13 + 0.31 -02 £05 


Knots A, D by contour method; Knots B, C from maximum-likelihood model. 
The distance along the jet is measured from the bend in the radio jet noted in Fig. 1. Bapp = v/c, where vis 
velocity. 


which has already begun in the final epoch from 2014, in which the 
knots begin to overlap (Fig. 2). 

In the internal shock model, the collision of two components results 
in particle acceleration, which will manifest in a significant brightening 
of the components as they combine into a single moving component. 
Our modelling results show that in the final 2014 epoch, both knots B 
and C brighten at the same time by approximately 40% over the mean 
flux level of the previous three epochs (Fig. 3). The brightening can also 
be seen in Extended Data Fig. 3, where we show the flux contour along 
the length of the jet (averaged transversely over a distance of 0.1”) for 
each epoch. 

The flux increase is also corroborated by measuring the flux within a 
circular aperture centred on and enveloping knots B and C in all 
epochs, which shows a significant total flux change of approximately 
20%-40% over prior epochs. Under equipartition, the cooling length 
for the optical-emitting electrons for knots B and C is longer than the 
distance they travelled in our observations, consistent with the lack of 
any decay in flux levels for these knots over the first three epochs. This 
is not the case for stationary knot A, which can be seen to decay with a 
timescale of around 70 years. Knot A appears to be analogous to knot 
HST-1 in the M87 jet; the latter is thought to be a stationary recon- 
finement shock where the jet pressure drops below that of the external 
environment”. The event that energized knot A may have been the 
passage of fast-moving knot B in about 19717§., yielding a timescale 
comparable to the knot A decay time. 

Previous radio observations of the 3C 264 jet show that the locations 
of the knots in contemporaneous radio and optical imaging are very 
similar'***. The earliest radio image of 3C 264 taken in October 1983 
with the VLA clearly shows three distinct features (see contour com- 
parison in Extended Data Fig. 4). While knot D appears to be com- 
pletely stationary over this time-frame, the data suggest the possibility 
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Figure 2 | Position versus time for knots B and C. The position of the centre 
of each knot is noted by thick red and blue lines, respectively, with line extent 
corresponding to the 3 y” modelling error plus the systematic error on the 
mean from the image registration. The grey extensions show the best-fit size of 
the resolved knot from the model. As shown, the best-fit linear slope yields a 
speed of 7 + 0.8 for knot B and 1.8 + 0.5 for knot C. In the final epoch the 
knots are directly adjacent and possibly overlapping, within the errors. 
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Figure 3 | Change in optical flux at 6,000 A in the colliding knots B and C 


over 20 years. The knots show a simultaneous significant increase in flux in the 
2014 image. Errors are the 3o 7? modelling error. 


that knots B and C were moving faster in the past and may have 
decelerated (Extended Data Fig. 5). 

In the internal shock model, the efficiency 7 of the conversion of the 
dissipated kinetic energy E4;,, into radiation is generally unknown. For 
the collision in 3C 264 we can estimate it from our observations (fur- 
ther details are in the Methods): we take the minimum possible value of 
the Lorentz factor 3 = 7.1 for knot B so that the jet is observed at an 
angle of 8.1°, and for slower knot C we take I, = 2.8. We assume an 
equipartition magnetic field and one proton per electron in the plasma 
to obtain masses of 2.5 X 10°” g and 6.7 X 10°° g for knots B and C. 
After the collision, the single combined component will move with 
Lorentz factor I), = 3.7. The total energy dissipated is taken as the 
difference in kinetic energy before and after the collision and is Egi,, = 
2.6 X 10°" erg. If the knot B+C complex stays at the flux level observed 
in 2014 throughout the duration of the collision (which we take to be 
the knot superposition time of 30 years in the observer’s frame, as 
one would expect if the cooling time of the optically emitting electrons 
is shorter than the collision time), the efficiency of conversion is only 
n = 10 *, considerably lower than is usually assumed in models. 

This, however, is a lower limit for two reasons. First, theoretical 
modelling” of internal shocks suggests that the flux should steadily 
rise to a peak occurring halfway through the collision, which would 
increase 77. Second, it is possible that the cooling time of the optically 
emitting electrons is longer than the collision time, which would 
increase the duration of the elevated optical—ultraviolet emission, 
and therefore 7. The rate of cooling depends on the magnetic field 
in the shocked plasma. Long-term monitoring of the collision in 3C 
264 over the coming decades could probe the evolution of the flux and 
hence constrain two free parameters of the internal shock model: the 
fraction of Egiss that goes to radiating electrons and the fraction that 
goes to generating the magnetic field in the shocked plasma. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

HST data analysis. These methods are essentially identical to that used in the 
previous study of M87, and we refer to ref. 21 for more extensive details beyond 
the outline given here. The imaging used for this study consisted of two orbits 
of F606W ACS/WEC imaging obtained in 2014 (eight exposures of approximately 
600 s each, proposal ID 13327), and the following WFPC2 planetary camera 
(PC) imaging: nine 140-s exposures in the F702W filter in 2002 (proposal ID 
9069), one 350-s and one 400-s exposure taken in the F791 filter in 1996 (proposal 
ID 5927), and two 140-s exposures in the F702W filter taken in 1994 (proposal 
ID 5476). 

To build the reference frame, in each ACS exposure approximately 100 
common globular clusters were detected and localized using a point spread 
function (PSF) peak-fitting routine. Taking into account the filter-dependent 
geometric distortion, a linear six-parameter transformation solution was found 
for each individual ACS exposure based on these positions into a common 
master frame. The final systematic error in the registration of the ACS expo- 
sures is about 0.007 pixels or 0.18 mas. The WFPC2/PC imaging was similarly 
matched to the master reference frame, using approximately 10-15 bright 
globular clusters per raw image. The final systematic error in the registration 
of the WFPC2/PC images is of the order of about 0.14 pixels or 3.5 mas. We 
include a systematic error of 5 mas for all WFPC2 images and 0.2 mas for the 
ACS image, added in quadrature to the measured positional errors in each 
epoch. For each of the four epochs, the transformation solutions were used to 
stack the exposures into a mean image (25 mas per pixel resolution), correct- 
ing for exposure time differences (scaled to the longest exposure), and itera- 
tively sigma-clipping to reject cosmic rays (when more than two images were 
used). The final image stacks are referred to below by the year of the observa- 
tions (1994, 1996, 2002, and 2014 images). 

The host galaxy of 3C 264 is elliptical, with a light distribution increasing 
smoothly towards the centre up to the inner 0.9”, where a dust disk is clearly 
present” and absorbing some of the galaxy light. The host galaxy light outside 
the dust disk was modelled individually for the 1996, 2002, and 2014 images using 
the IRAF (image reduction and analysis facility) software tasks ellipse and bmodel, 
and applying a selective mask to avoid the regions of the jet and globular clusters 
(Extended Data Fig. 6). The 2002 model was also used for the 1994 epoch since the 
former is deeper. The dust disk was modelled individually for each epoch, by 
measuring the flux along progressively smaller contour line circles around the 
centre of the galaxy. The model flux was taken to be a moving average around 
any individual pixel, interpolating over masked-out areas. The dust disk was 
modelled up to two pixels (50 mas) from the centre of the galaxy. Inside this radius 
it is very difficult to reliably separate the core of the jet (a point source) from the 
galaxy owing to the irregular dust disk, which prevents fitting of the galaxy with a 
Sérsic or exponential model. Since the core flux level is not of primary interest for 
this study, we simply take the image values inside this radius as the model, and 
then subtract the entire model of galaxy and dust disk from our images for each 
epoch. Finally, we scaled all of the images to flux units (in pJy) at a reference 
frequency of 6,000 A using the IRAF task calcphot, assuming a power-law form 
(fy x v_ |), E(B — V) = 0.02, and making the correction for sensitivity changes 
between decontamination events. 

Proper motions. For all knots (A-D) and all epochs, we applied the same contour 
technique to measure the locations of each knot as applied in the M87 study”, 
except that the contour line levels were drawn at 30% flux over background. In the 
final 2014 epoch, knots B and C were inside a single contour. The knots are also 
resolved and cannot be easily modelled as point sources to calculate their flux level. 
To better measure the fluxes of knots and obtain independent locations for the 
knots, we modelled the jet as a two-dimensional image, focusing only on the region 
of knots A, B and C. The model, described below, has 14 parameters, and our 
maximum-likelihood optimization yields an estimate of the knot flux and location 
relative to the core. 

Maximum-likelihood modelling of the 3C 264 jet. In all epochs, the model 
was optimized to match the two-dimensional image over a subset of the 
image plane centred on knots B and C (Extended Data Fig. 7), covering a 
total of 226 reference frame pixels, which we call the ‘image box’. We fixed 
the central jet axis at a position angle (North through East) of 40.43°, running 
from one end of the image box to the other. The ‘background’ jet is modelled 
as a cone parameterized by radius |, at the distance from the core coinciding 
with the start of the image box and increasing linearly to |, at the other end of 
the box. The intensity of the jet (in jy per pixel) is characterized by a single 
value Iie. The effective intensity at any given point changes both linearly 
along the jet (in such a way as to mimic an expanding jet with a constant 
rate of flow) and laterally from the jet axis, to mimic a decreasing line-of- 
sight depth through the jet. Given any point inside the defined jet area, we 
have wje; as the radius of the jet at that distance from the core, and xj as the 


perpendicular distance of the particular point from the axis. The specific 


intensity is thus 
1 
T= hay Of — ahd / Whe (1) 
Jel 


On top of this jet, we model knots A, B, and C as circular disks parame- 
terized by radii (ra, rg, rc), distances from the core (dg, dg, dc), and flux 
densities in Jy per pixel (Ia, Ip, Ic) which is constant over the disk and zero 
outside. Knots B and C were also allowed to move in the transverse direction 
perpendicular to the jet axis. Knot D is not included because the image box 
does not include this area. The full jet model thus has 14 free parameters. The 
simulated image is created by adding the contribution of each of the four 
components to each individual pixel, before applying the relevant PSF. 

To accurately fit the jet model to the image data, the simulated image must be 
convolved with an appropriate PSF, which is different for each image epoch. We 
used the TinyTim” software package to create model PSFs for both WFPC2/PC 
and ACS/WFC using the appropriate filter to match the observations. We chose a 
roughly central pixel location—(400, 400) for WFPC2/PC or (1,000, 3,000) for 
ACS/WEC—standard focus, and a power-law object spectrum with Fy ~ v_', the 
spectral index of — 1 being that previously observed for the knots in the jet’*. For 
the WFPC2/PC PSFs, the initial PSF was subsampled by a factor of ten before 
being rebinned into the WFPC2/PC native PC scale, smoothed with the recom- 
mended 3 X 3 pixel Gaussian, and finally resampled onto the 0.025” pixel scale of 
our stacked images. Optimal values of the 14 free parameters which lead to a 
simulated image that best fits the data were found by the method of maximum 
likelihood. We treated each pixel as an independent Poisson trial with a predicted 
value 4; and measured value x;, where 


i 
yap +b; (2) 
and 
nag +b; (3) 


Here g is the gain (g = 7 electrons per DN, where DN is data number, for the 
WEPC2/PC images, g = 1 for the ACS/WFC images) and p; is the predicted flux in 
the simulated image pixel. The constant dgcaie is the scaling factor, specific to each 
epoch, which was originally used to multiplicatively scale the images from units of 
DN or electrons to flux in jJy, while b; is the (unscaled) background estimation 
that was subtracted from the images before scaling. The same equation is used to 
calculate x; except that we use the measured flux in each pixel f; rather than the 
simulated prediction. 

The maximum-likelihood method requires that we maximize the likelihood 
function, which is simply the product of the probability density values for each 
pixel. Because the number of total counts is large (typically >> 1,000) and in order 
to accurately calculate the likelihood contribution for each pixel, we use the normal 
approximation to the Poisson probability distribution in calculating the likelihood. 
We also chose to maximize the log of the likelihood function, which is equivalent 
to maximizing the likelihood but more practical to implement. Given the above 
definitions of 1, and x; for the ith pixel in the image box, the contribution to the 
likelihood is simply 


1 =(j—xj)? 


= 2aj/a 4 
~ Jinn () 


where n is the total number of images in the stack. The final log-likelihood function 
to be maximized is thus 


L= do log (pri) (5) 


where f is the correction for the different pixel scales between WFPC2/PC and 
ACS/WFC and our super-sampled reference frame described in the methods, with 
f = 3.3856 for WFPC2/PC and f = 3.9204 for ACS/WEC. 

Although a number of approaches are possible for finding the optimal values of 
the 14 free parameters that will maximize £, we chose to use the freely available 
Simulated Annealing code*®, which is well-suited to multi-parameter optimiza- 
tions of non-smooth functions that are likely to have many false maxima ‘traps’. 
All 14 parameters were allowed to roam inside a wide range of reasonable values, 
and we confirmed that the final optimal value did not show signs of running into 
upper or lower bounds. 

The final results of the jet modelling are given in Extended Data Table 1. We 
have used the data for knots B and C only in the main part of the paper, as the 
subtraction of the background on the inner side of knot A is somewhat unreliable 
due to the nearness of the core, leading us to place one edge of the image box over 
the inner side of knot A. Knot A was thus included in the model more for 
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improving the fit of the other components than for measurement. In the main text, 
location values for knots A and D are taken from the contour method, while for 
knots B and C are taken from the best-fit model (radial positions only). Note that 
the transverse motions are only marginally significant once the systematic regis- 
tration error is included, except in the final epoch where both knots are slightly 
deviated to the North. 

Errors for each parameter were calculated by freezing all other parameters to 
their best-fit values, and progressively increasing (or decreasing) the value of the 
parameter of interest. This allowed us to plot the change in the likelihood function 
AEL versus the parameter value. The shape of this function is roughly parabolic 
with a peak at the optimal parameter value. The distribution of 2AL isa 7’ variable 
with one degree of freedom. Thus an upper and lower error value can be calculated 
by finding the parameter value corresponding to a 30 value of 2AL. The errors 
reported in Extended Data Table 1 are the 3a errors. Since the errors are two-sided, 
the criterion used was that 2AL = 10.27288. Fluxes were calculated by multiplying 
the flux density best-fit value by mr” where ris the best-fit radius, and appropriately 
propagating the errors. 

The flux increase implied by the modelling results can be checked using aper- 

ture photometry. We used a 6.5-pixel circular aperture, centred on knots B and C 
so that both are almost entirely contained within the aperture during all four 
epochs without moving it. Using the 2014 modelling results, we estimate that 
the smooth-jet contribution to the flux in this aperture is 17.4 jy. The total fluxes 
attributed to the jet for the four epochs are 7.9 Jy, 7.8 Wy, 6.7 Wy, and 9.4 wy, 
indicating a 20%-40% flux change in the final epoch. The estimated error on these 
fluxes is approximately 0.3 Jy, where errors from the Poisson noise (0.2 py and 
0.05 Jy for WFPC2 and ACS respectively), irregularities in the dust disk (0.15 
Jy), and possible contributions of knot A to the aperture (0.2 py) have been 
added in quadrature. Note that these estimates do not correct for the flux which 
falls outside the aperture, and are thus slightly less than the totals for knots B and C 
taken from Extended Data Table 1. 
The extended baseline with VLA observations. 3C 264 was observed in October 
1983 at U band (14.9 GHz) in A configuration as part of program AB239. The 
quasar 3C 286 was used as the primary flux density calibrator source while QSO 
1144+402 was used for a primary phase calibrator and secondary flux density 
calibrator. The observations used two intermediate frequencies in each band, each 
with a bandwidth of 50 MHz, and the average of the two was used to produce the 
final maps. The reduction of the 1983 U-band data followed closely the recom- 
mended procedure in chapter 4 of the AIPS Cookbook (http://www.aips.n- 
rao.edu). Very little flagging was required. Once calibrated, a clean map was 
made with several rounds of self-calibration, and deconvolution using the AIPS 
tasks clean and calib. We used the ‘robust’ or Briggs’ weighting scheme to weight 
the visibilities. 

The earlier radio image clearly shows three distinct features (see contour com- 
parison in Extended Data Fig. 4). In the case of knot B, a quadratic fit to the HST 
and VLA data suggests that in late 1983 the knot had a speed of 10.2c, slowing by 
0.16c per year to reach 5.6c at the beginning of 2014. While a quadratic fit is very 
poor for the combined data on knot C, a linear fit between the 1983 and 1994 
epochs similarly suggests a speed of 9.8c. Strong conclusions are prevented by the 
disparity in imaging frequencies and the large errors on the radio positions. 
Theoretical considerations. We assume that the flow consists of an electron- 
proton plasma, with one proton per electron, in energy equipartition with the 
comoving magnetic field. We derive the equipartition magnetic field values at 
knots B and C using standard expressions including beaming”. We use the 
2002 flux values at 22.46 GHz of 4.4 mJy and 4.1 mJy, and diameters of 39.6 pc 
and 48.9 pc, respectively, for knots B and C, as well as the observed radio spectral 
index of « = 0.5. We assume that the knots are spherical, noting that the measured 
diameter in the commoving frame for a spherical blob equals that in the observer’s 
frame. We assume a low-frequency cutoff of 10 MHz in the observer’s frame. We 
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derive values of 1.2/6 mG and 1.0/6 mG (where 6 is the Doppler factor) for knots B 
and C respectively. These values are nearly identical to the values given in ref. 23 
once corrected for beaming (they assumed a jet angle to the line of sight 0 = 50°, 
which has now been ruled out by the superluminal motions we observe). 

We calculated masses mp = 2.5 X 10°° gand mc = 6.7 X 10°° ¢g for knots B and 
C respectively, through m = 2U,V/c’, where Us is the energy density of the 
magnetic field in equipartition and V is the volume of the knot in the comoving 
frame. Using these masses and the Lorentz factors Ip = 7.1, Ic = 2.8 (as derived 
by the relation [3 .min = (Berapp- + 1)'7, and assuming that 3 = Ig, min, which 
sets the jet angle 0 to the line of sight to arcsin 0 = 1/I’, = 8.1°), the Lorentz factor 
Im of the single component resulting from the collision is”’ 


( fal yt Micl'g i. 3.7 (6) 
mp/Igp+mc/Ic 


as found by conservation of momentum. The energy that will be dissipated cumu- 
latively throughout the collision is 


EBaiss = mg('3 —I'm) +mc(P'c — I'm) = 2.6 X 10° erg (7) 


Assuming that the collision lasts from the moment the collision commences up 
to the point where the back end of the fast component has reached the back end of 
the slow component, the duration of the collision in terms of coordinate time in 
the galaxy frame is At = 2R'/[I3(bg — Bc)c] ~ 340 years, where R’ is the 
comoving radius of the fast component, while the duration for the observer is 
Atops = 2R'/(T pl coc(Pa — Bc)c) ~ 25 years, in approximate agreement with the 
30-year time required for the knots to superimpose based on the observed HST 
proper motions. The power dissipated in the galaxy frame is then Pyiss = Eaiss/At = 
2.4 X 10"' ergs '. If the dissipated energy was completely converted to radiation, 
the solid-angle integrated luminosity in the galaxy frame L4, would be equal to 
Paiss- We can estimate the current L4, due to the collision by using the flux increase 
fcou in knots B and C from 2002 to 2014, to obtain the luminosity increase assum- 
ing isotropy L = 1.3 X 10*° ergs” ', which corresponds to a solid-angle integrated 
luminosity La, = LI, 2104 = 1.6 X 10°8 erg s |. The efficiency of converting the 
available energy to radiation is then 1 = Lay/Paiss ~ 10 *;as explained in the main 
text, this is a lower limit. 

The cooling length of a knot. Using standard formulae, the comoving frame 
cooling time at an observed frequency v of a knot in equipartition is 


6nmec (MeCBegy\ 2, 
t! =(" 2) P= te (8) 


2 
OT Beg. hBav 


where B40 is the equipartition magnetic field assuming 6 = 1, m, is the electron 
mass, oy is the Thomson cross-section, h is Planck’s constant, B,, = 2mm2c*/(eh) = 
4.4 X 10" G is the critical magnetic field and e is the electron charge. This corre- 
sponds to a propagation length of 1=cBI't’ =cB I'S" to, which in turn corresponds 
to a projected cooling length J,,; =/sin0 = focd Bapp» where we have used the stand- 


ard expression f,,, = Psin0/(1 — B cos 0). Using values appropriate for knot B we 


app 
obtain /,4; = 874 pe and for knot C [,p; = 104 pe, in both cases longer than the 
propagation length recorded by our observations (about 43 pc for knot B and about 
11 pe for knot C). 
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Extended Data Figure 1 | Position versus year for knot A. Positions are taken 
from the contour analysis method. Errors are 1o from contour-derived position 


measurement added to the systematic error on the mean position from the 
image registration in quadrature. 
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Extended Data Figure 2 | Position versus year for knot D. Positions are 
taken from the contour analysis method. Errors are 1o from contour-derived 


position measurement added to the systematic error on the mean position from 
the image registration in quadrature. 
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Extended Data Figure 3 | Mean flux along the midline of the jet. Flux is 
averaged transversely over 0.1” at each step along the jet. The epoch is denoted 
by arrows. Note the decay of knot A with time, while the knot B+C complex 
appears substantially brighter in the last epoch. 
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Extended Data Figure 4 | Comparison of 1983 VLA midline flux to 1994 
HST midline flux. Flux is averaged transversely over 0.1” at each step along the 
jet. We fit parabolic forms to the peaks seen in the radio contour to derive rough 
estimates of knot location, with values depicted with short dashed lines. Dotted 
lines connecting matched knots are shown to guide the eye. 
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Extended Data Figure 5 | Comparison of 1983 radio positions of knots with 
HST data. The locations of knots B and C in 1983 are denoted by triangles, and 
we include an estimated 15 mas error on the position based on a typical 
centroiding error of 10% of the beam size (0.15”). Optical data are identical to 
that described in Fig. 2. For knot C, a dashed line gives the fit to epochs 1983 and 
1994. For knot B, the dashed line is a parabolic fit to all data. The location of 
knot A is noted with a dotted line. 
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Extended Data Figure 6 | Depiction of background subtraction method. galaxy and dust model. d, The same image as a after c has been subtracted. 


a, The final 2014 image stack before subtraction. b, The mask used in modelling Physical scaling of all images and flux scaling of a, b and d are identical. 
the galaxy and dust disk. (White areas were not used in fits.) c, The final 2014 
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Extended Data Figure 7 | Depiction of modelling results. a, The resulting background-subtracted model image for the 2002 epoch. b, The real background- 
subtracted 2002 image. In both cases the image box is shown by the red outline. Only pixels within this area were used in the fit. 
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Extended Data Table 1 | Maximum-likelihood modelling results 


Radial Transverse Radial Total 
Name Year i _ Intensity Radius oe 
Position Position Position* Flux Size 
(pixels) (pixels) (uJy/pixel) (pixels) (mas) (uJy) (pc) 
Knot At 1994 5.45 + 0.13 ee 0.65 + 0.04 1.28 + 0.04 136 +5 3.3 + 0.3 28.3 + 1.0 
1996 4.95 + 0.12 is 0.43 + 0.02 2.01 + 0.05 124+5 55+0.3 444+ 1.0 
2002 4.89 + 0.09 ee 0.24 + 0.01 2.05 + 0.04 122 + 4 3.1+0.1 453+ 0.9 
2014 4.97 + 0.12 es 0.47 + 0.02 1.42 + 0.03 124+ 3 3.0+0.1 31.4 + 0.6 
Knot B 1994 10.28 + 0.15 0.45 + 0.13 0.37 + 0.02 1.76 + 0.05 257+ 5 3.6 + 0.3 38.8 + 1.2 
1996 11.26 + 0.11 -0.32 + 0.11 0.42 + 0.02 1.62 + 0.04 281+ 5 3.5+0.2 35.9 + 1.0 
2002 12.11 + 0.06 0.28 + 0.05 0.41 + 0.01 1.79 + 0.02 303 + 4 4.2+0.1 39.6+ 0.5 
2014 14.35 + 0.07 -0.70 + 0.05 0.42 + 0.01 1.98 + 0.02 359 + 2 5.3 + 0.1 43.9+ 0.5 
Knot C 1994 18.02 + 0.19 -0.08 + 0.15 0.24 + 0.02 2.06 + 0.07 450 + 6 3:2: :0:3' 45.6 + 1.6 
1996 18.39 + 0.11 -0.41 + 0.09 0.36 + 0.01 1.98 + 0.04 460 + 4 45+ 0.2 43.7+ 0.9 
2002 18.32 + 0.07 -0.28 + 0.06 0.27 + 0.01 2.21 + 0.03 458 + 4 4.2+0.1 48.9 + 0.6 
2014 19.14 + 0.10 -0.56 + 0.07 0.28 + 0.01 2.51 + 0.03 478 + 2 55+0.1 55.4 + 0.6 


* Systematic error of 3.5 (0.18) mas for WFPC2 (ACS) images has been added to the measurement error from column 3. ¢Knot A values are included for completeness; see Methods for note on reliability of knot A 
positions. 
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Extreme ultraviolet high-harmonic spectroscopy 


of solids 


T. T. Luu!*, M. Garg'*, S. Yu. Kruchinin!, A. Moulet!, M. Th. Hassan! & E. Goulielmakis! 


Extreme ultraviolet (EUV) high-harmonic radiation’” emerging 
from laser-driven atoms, molecules or plasmas underlies powerful 
attosecond spectroscopy techniques’ and provides insight into 
fundamental structural and dynamic properties of matter®’. The 
advancement of these spectroscopy techniques to study strong- 
field electron dynamics in condensed matter calls for the genera- 
tion and manipulation of EUV radiation in bulk solids, but this 
capability has remained beyond the reach of optical sciences. 
Recent experiments*” and theoretical predictions’’’” paved the 
way to strong-field physics in solids by demonstrating the genera- 
tion and optical control of deep ultraviolet radiation® in bulk semi- 
conductors, driven by femtosecond mid-infrared fields or the 
coherent up-conversion of terahertz fields to multi-octave spectra 
in the mid-infrared and optical frequencies’. Here we demonstrate 
that thin films of SiO, exposed to intense, few-cycle to sub-cycle 
pulses give rise to wideband coherent EUV radiation extending in 
energy to about 40 electronvolts. Our study indicates the asso- 
ciation of the emitted EUV radiation with intraband currents of 
multi-petahertz frequency, induced in the lowest conduction band 
of SiO,. To demonstrate the applicability of high-harmonic spec- 
troscopy to solids, we exploit the EUV spectra to gain access to fine 
details of the energy dispersion profile of the conduction band that 
are as yet inaccessible by photoemission spectroscopy in wide- 
bandgap dielectrics. In addition, we use the EUV spectra to trace 
the attosecond control of the intraband electron motion induced 
by synthesized optical transients. Our work advances lightwave 
electronics”’** in condensed matter into the realm of multi- 
petahertz frequencies and their attosecond control, and marks 
the advent of solid-state EUV photonics. 

Coherent radiation, emerging from the nonlinear interaction of 
intense lasers with matter, contains information about structure and 
dynamics and has lent itself to a wide range of fundamental spectro- 
scopy techniques*”*. In strong-field laser physics’, acceleration of elec- 
tron wavepackets’” launched by the strong-field ionization’* of atoms 
or molecules and their re-collision with their parent ions gives rise to 
coherent attosecond EUV radiation’”. Such radiation can provide 
detailed information about electronic structure’’, the atomic motion 
in molecules””’, and the dynamics of free’ and bound” electron wave- 
packets in the generated ionic species. Here we demonstrate the exten- 
sion of strong-field EUV spectroscopy and photonics to the condensed 
phase. 

In our experiments, thin (~120 nm), free-standing, polycrystalline 
SiO, films (see Supplementary Information section I) were exposed to 
intense field-sculpted, ultrafast optical waveforms generated in a sec- 
ond-generation light field synthesizer (refs 23 and 24 and M.Th.H. 
et al., manuscript in preparation). In Fig. 1c we show spectra of the 
radiation emerging from the bulk of the SiO, samples (Fig. 1b) when 
driven by the field of a 1.5-cycle pulse (Fig. 1a; energy ~0.5 J) as a 
function of the transmitted field amplitude that was varied in the range 
~0.7-1.1 V A’! (in the bulk of SiO,). Regularly spaced spectral peaks, 
the energy of which match well the odd harmonics of the ultrashort 


pulse carrier frequency, attest to the coherence between the driving 
and emitted EUV fields. 

A gradual increase in the field strength of the optical pulse gives rise 
to the formation ofa robust spectral plateau extending over more than 
~20 eV (15-35 eV) in the EUV range. The cutoff energy, defined as the 
last harmonic peak discernible at the high-energy end of the EUV 
spectral plateau, undergoes a linear increase with optical field strength 
(Fig. 1d). The intensity yield of the emitted harmonic radiation exhi- 
bits a highly nonlinear dependence on optical bias, unique for each 
individual harmonic peak (Fig. le-g). The slope of the intensity yield 
declines at higher optical biasing of the medium (Fig. le), while for 
some of the harmonic peaks, it rises again with further increase of the 
field strength (Fig. 1f, g). 

The linear scaling of the harmonic cutoff energy with the field 
amplitude distinguishes the underlying mechanism of high-harmonic 
radiation from that in a gas phase”, agrees with the scaling observed in 
previous studies of coherent, laser-driven dynamics in solids giving 
rise to radiation in the terahertz’’, optical? and vacuum ultraviolet* 
ranges, and dictates the need for theoretical modelling tailored to 
describe the interaction of intense optical fields with bulk solids. To 
gain insight into the physical dynamics underlying EUV emission in 
our experiments, we conducted both semiclassical’? and quantum- 
mechanical simulations'' under the precise experimental conditions 
given above. 

In the tight-binding model beyond the nearest-neighbour approxi- 
mation, the dependence of energy E,(k) in the ith band on crystal 
momentum k is represented by a series of spatial harmonics: 


Nmax 


E,(k) = S © &i,n cos (nka) (1) 
n=0 


Here a is the lattice constant, n is the lattice site number (order of the 
spatial harmonic), nyax is the maximum number of distant neighbours 
included in the model, ¢;,, are Fourier coefficients, and ¢;9 is the band 
offset (see Supplementary Information section II). The highest character- 
istic distance in the crystal is, therefore, Rinax = Nmax4. Consequently, the 
maximum energy that an electron wavepacket can acquire from the field 
is Emax = @FoRmax = Nmax/i@p, where Wz = eFoa/h is the Bloch frequency 
and F> is the field amplitude. 

To illustrate the role of the energy dispersion of the lowest conduc- 
tion band in the emission of EUV radiation, we first perform semi- 
classical simulations of a pre-existing wavepacket driven by the electric 
field in Fig. 1a, for Fy = 0.8 VA’ | along the contour of different spatial 
components, which are included in the Fourier series (equation (1)) of 
the band dispersion (Fig. 2b); see Supplementary Information section 
VIII for the case of Fy = 1.1 V Al Inour simulations, the amplitudes 
cm as well as the number of significant spatial harmonics, are derived 
from the Fourier expansion of the band along the I'-M direction in the 
Brillouin zone (see Supplementary Information section II). The simu- 
lations (Fig. 1d) highlight the gradual increase of the cutoff energy with 
spatial harmonic order n and suggest that, under our experimental 
conditions, deceleration of electrons by the spatial harmonics of orders 
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Figure 1 | Generation, energy cutoff and intensity scaling of coherent EUV 
radiation in SiO, films. a, The 1.5-cycle driving pulse (with spectral range 
Ay, ~ 1.1-2.5 eV) produced in a light field synthesizer and characterized 
by attosecond streaking, before being focused onto a thin film of SiO, 
(~120 nm) to generate EUV radiation (b). c, EUV spectra recorded for 
increasing field strengths for which the screening factor (~0.8) derived 
from the Fresnel formula is taken into account. The dashed portion of the 
curves indicates the noise floor. d, Scaling of the cutoff energy Ecutorr (blue 
dots with error bars) with the field amplitude Fo. Shown is linear fitting 
of the measured data (blue line) and results of semiclassical simulations 


Figure 2 | Semiclassical picture of the field-driven electron dynamics in 
SiO,. a, An electron wavepacket driven by the field in the lowest conduction 
band along the I-M direction of the Brillouin zone. b, Spatial harmonic 
components of the band dispersion resulting from equation (15) in 
Supplementary Information. Relative amplitudes are scaled, for clarity. 

c, Contributions to the EUV spectra from each of the spatial harmonics for the 
field of Fig. 1a at the amplitude Fy = 0.8 V A7}. The total emitted spectrum 


n = 5,6, and 7 play a central role in the emission of radiation near the 
cutoff energy. 

From the linear fitting of experimental data (Fig. 1d, blue line), we 
evaluate the largest characteristic distance significantly contributing to 
the spectral intensity as Reutort = Ncutofl = e ‘dE cutoft/ dF, and obtain 
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(red line). e-g, Yield dependence for harmonics of order N = 11 (e), 13 
(f) and 15 (g) on the field amplitude (blue dots). Red curves represent 
fittings based on equation (3), including all spatial harmonics up to Max. 
Green curves in e, g, represent the fitting of experimental data with 
equation (3), where only one spatial harmonic (n = 5 and n = 6, 
respectively) was taken into account. Error bars depict the standard 
deviations of the values extracted from several data sets recorded under 
identical conditions. Corresponding slopes of the intensity curves at 0.7 V 
A~' are evaluated as ~F, ~F,', and ~ F}°, respectively. 
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(black curve) aggregates contributions from all spatial harmonics. d, Simulated 
spectra for the electric field waveform (Fig. 1a) for the realistic band shown in 
aas a function of the peak field strength Fo. Colour-coding corresponds to field 
strengths of Fig. 1c. e-g, Contributions of spatial harmonics n = 4, 5, 6 to the 
intensity yields of EUV harmonics of orders N = 11 (e), 13 (f) and 15 (g) versus 
field amplitude Fo. 


Reatorr ~ 30 A and neytore = 6.2 + 0.5. Reutofris smaller than the average 
size of the crystalline domain (which is > 40 A), and thus, a scattering 
at domain edges can be neglected (see Supplementary Information 
section I). Extrapolation of the linear fitting to the zero field Fp = 0 
yields a cutoff energy Ecutorpp =0.6 + 2.0 eV (see Supplementary 
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Information section III) and even further strengthens the link between 
the emitted radiation and the induction of intraband currents consid- 
ered in the semiclassical treatment, where Egutor¢ = 0 for Fy = 0 is 
predicted. When performed under the realistic energy dispersion 
of the lowest conduction band E(k) along the I-M direction 
(Fig. 2a), the semiclassical simulation yields a reasonable reproduction 
(Fig. 2d) of the experimental spectra shown in Fig. 1c. The calculated 
maximum cutoff energy Ecutory ~ 33 eV (within the same dynamic 
range of our experiments), its linear scaling versus field amplitude Fo, 
and Meutore = 6 evaluated from the slope of the cutoff dependence of the 
simulated spectra (red line, Fig. 1d) are in excellent agreement with our 
experimental findings (blue line, Fig. 1d). Simulations assuming that 
the field polarization is parallel to the -K or T-A directions of SiOz 
yielded EUV spectra of an order of magnitude weaker intensity and 
lower cutoff energy in comparison to those along the I'-M direction 
(see Supplementary Information section II, Supplementary Fig. 2). The 
contribution of the highest valence bands was also found to be neg- 
ligible (see Supplementary Information section II). The above conclu- 
sions support the notion that, in our experiments, the EUV emission 
primarily occurs due to acceleration of first conduction band electrons 
in microcrystals, for which the I-M direction is parallel to the laser 
polarization axis. 

In our quantum-mechanical simulations based on the numerical 
solution of the semiconductor Bloch equations””’, we find that the 
cutoff energy and fine details of our experimental spectra are reas- 
onably reproduced by the intraband current only, but they disagree 
with those obtained from interband or total polarization (see 
Supplementary Information section IV, Supplementary Figs 8, 9). 
These results were further verified by contrasting the intensity depend- 
ence of the 11th harmonic in our experiment with that predicted by the 
semiclassical model and interband and intraband terms of quantum- 
mechanical simulations (see Supplementary Information section V). 

The previous comparison of simulations with measurements pin- 
points the importance of intraband dynamics in describing the process 
and the capability of the semiclassical approach to capture key aspects 
of the emission of EUV radiation, including the nonlinearity pertain- 
ing to the intensity scaling. As a result, the characteristics of the EUV 
spectra offer direct insight into the generation of multi-PHz field- 
tunable intraband currents, whose cutoff frequency is adjustable from 
Veutoff 3.6 PHz to 8.5 PHz (Fig. 1c). These are the highest frequency 
currents so far induced in a solid medium by optical fields, and more 
than an order of magnitude higher than those in previous experiments 
in solid-state lightwave electronics'*»>. 

The semiclassical theory of high-harmonic radiation”® predicts that 
the intensity of the emitted spectra in the case of a cosine band exhibits 
an extreme nonlinearity versus the field amplitude Fo. For an electric 
field carried at a frequency @ ,, the intensity dependence of the Nth 
order harmonic is proportional to 

In ea 


hoy, 


where Jy(z) is the Bessel function of the first kind of order N. 

For a band whose dispersion profile comprises a few spatial har- 
monics, equation (2) can be generalized (see Supplementary 
Information section II) to account for the contributions of distant 
neighbours to the yield of the Nth harmonic peak as 
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These equations are also accurate in the short pulse limit with the 
difference that, in this case, Fo refers to the peak field strength of the 
pulse. Figure 2e-g illustrates how the representative spatial harmonics 
(n = 4, 5, 6) influence the yield dependence of EUV harmonic peaks in 
our spectral range versus Fy according to equation (3). These plots 
underpin the possibility of extracting structural information about the 
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underlying medium by fitting the yield of a single or several EUV 
harmonic peaks by equation (3). Indeed, a simultaneous fitting for 
harmonic orders 11 to 15 (see Fig. le-g, red curves) by equation (3) 
yields an excellent regression and allows for retrieval of the amplitudes 
€cn in expansion (1). 

Based on the confidence interval of the reconstruction, our fitting 
reveals that the intensity dependence of EUV harmonics in the spectral 
range of our study aggregates primarily the contributions of EUV 
photons emitted when the field-driven electron wavepacket is decel- 
erated by the spatial harmonics n = 5 and 6. For instance, the yield of 
EUV harmonic N = 11 is dominated by the deceleration of the wave- 
packet by the spatial harmonic n = 5 (Fig. le, green curve), while for 
N = 15, deceleration by n = 6 dominates (Fig. 1g, green curve); we 
verify this by fitting their yield based on equation (3) for the corres- 
ponding spatial harmonics. This result is in excellent agreement with 
the independent conclusions of the cutoff analysis of Fig. 1d, which 
reveals similar sensitivity to spatial harmonics of the cutoff energy of 
the emitted radiation. 

The evaluated ratio of the significantly contributing coefficients is 
&c5/E6 ~ —5.1 + 0.6 and accurately matches the theoretically pre- 
dicted ratio” obtained in the expansion given by equation (15) in the 
Supplementary Information, é.5/é.6 ~ —5.6. This result offers addi- 
tional evidence that the laser-driven intraband current accurately cap- 
tures the EUV emission and, most importantly, constitutes the first 
measurement of the fine details of the conduction band dispersion in 
wide bandgap materials. Indeed, the large insulating bandgap and low 
mobility of photo-excited carriers prevent replenishing of the created 
vacancies, give rise to charging effects, which in turn make photoemis- 
sion spectroscopies** impractical, and limit their reach to valence or 
low-lying conduction bands”. As a pure polarization spectroscopy, 
our approach overcomes this essential frontier and highlights the 
promise for a new range of studies in dielectric materials. The resolu- 
tion pertinent to our technique can be estimated from the highest 
spatial harmonic order influencing the EUV emission as 1/6a ~ 0.1 
A”! (for Noutofr = 6), Which is close to that of modern angular resolved 
photoemission spectroscopy. 

In a second set of experiments, we dynamically probe the physics 
underlying EUV generation in the condensed phase and explore routes 
towards the ultrafast control of multi-PHz-scale currents by conduct- 
ing measurements of EUV spectra generated by various waveforms 
including few-cycle, 1.5-cycle, sub-cycle and half-cycle pulses. Spectra 
recorded for two settings of the carrier envelope phase (CEP) of the 
above driving fields are shown in Fig. 3. The increase of the averaged 
inter-peak energy spacing (Fig. 3a to Fig. 3b) and the broadening of 
these harmonic peaks (Fig. 3c) are compatible with increasing driver 
frequencies and decreasing pulse duration of the optical field, respect- 
ively, and further corroborate the coherent relation between the emit- 
ted and driving fields. In contrast to few-cycle and 1.5-cycle driving 
fields (Fig. 3a, b), sub-cycle and, even more evidently, half-cycle field 
waveforms give rise to EUV continua (Fig. 3c, d), whose amplitude and 
shape show a considerable sensitivity to the CEP adjustments. 

The dynamic control of the emission attained by CEP manipulation 
of the waveform of the driver pulses lends itself to use as an additional 
platform to verify the validity of the conclusions reached in the cutoff 
and intensity analysis (Fig. 1d—g), by further contrasting our findings 
with quantum-mechanical and semiclassical models. Figure 3e, f, k, 1 
shows simulated temporal dynamics of an electron wavepacket in the 
conduction band of SiO, for two representative waveforms of the few- 
cycle and half-cycle driving fields, and for the phase settings of the 
insets of Fig. 3a, d. Interband dynamics give rise to periodic modula- 
tions of the amplitude of the wavepacket as a function of time, whereas 
the intraband dynamics are best manifested by the lateral oscillatory 
displacement of the excited wavepacket. Figure 3g, m depicts the simu- 
lated temporal intensity profiles of the radiation induced by intraband 
currents, spectrally filtered in the energy range of our experiments 
(>15 eV); Fig. 3h, n shows the results of semiclassical simulations; 
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Figure 3 | Field control of laser-driven EUV emission in SiO, (experiment 
and simulation). a-d, EUV harmonic spectra recorded for two CEP settings 
(Gcz ~ 0, blue; Pcy ~ 71/2 rad, magenta) of the driving waveforms shown in 
insets: a, few-cycle (4, ~ 1.1-1.8 eV and carrier frequency wy, ~ 1.52 eV), 

b, 1.5-cycle (4, ~ 1.1-2.5 eV, wy, = 1.96 eV), ¢, sub-cycle (4, ~ 1.1-3.1 eV, 
@y, © 2.227 eV), and d, half-cycle (A, ~ 1.1-4.6 eV, wy, ~ 2.1 eV) pulses. The 
waveforms are sampled by attosecond streaking. e, f, Population dynamics in 
the lowest conduction band of SiO along the I'-M direction driven by the few- 


and Fig. 3i, j, o, p show the corresponding spectra (see Supplementary 
Information section IV for additional information). Importantly and 
in support of the conclusions of the intensity scaling analysis, the 
results of the semiclassical simulations for the entire range of pulse 
waveforms exploited are virtually identical, if we consider the band 
dispersion with the terms ~cos(5ka)-0.19cos(6ka), which represent 
significantly contributing spatial harmonics and their experimentally 
retrieved relative amplitudes (Fig. 1d-g). 

The reasonable reproduction of the experimental spectra for the 
broad range of waveforms used (see also Supplementary 
Information section IV) by both the semiclassically and quantum- 
mechanically simulated intraband currents allows the linking of the 
temporal electron dynamics underlying our simulations with those in 
our experiments. For few-cycle pulses (Fig. 3g), excitation of the cur- 
rents and concomitant radiation extend temporally over several field 
cycles. The current profile (Fig. 3g, h, red and blue curves) in this case is 
temporally displaced by a CEP variation of the optical driving field, but 
its structure remains virtually unaffected, resulting in nearly invariable 
spectra (Fig. 3i, j) with respect to the variation of this pulse parameter, 
in agreement with our experiments (Fig. 3a). By contrast, half-cycle 
pulses confine and control electric currents to the optical cycle 


20 25 30 35 15 20 25 30 35 


Energy (eV) Energy (eV) 


cycle pulse with gcy ~ 0 and gcx ~ 1/2 rad, respectively (insets of 

a).g,h, Temporal intensity profiles I(t) oc|@J(t)/6t|* of the radiated EUV fields 
in the spectral region > 15 eV for a few-cycle pulse, calculated with quantum- 
mechanical (QM) and semiclassical (SC) models. J(t) denotes the intraband 
current. i, Corresponding spectra obtained from quantum-mechanical and 

j, semiclassical simulations for a few-cycle pulse for the above-mentioned CEP 
settings are marked by the same colour code. k—p, Same as e-j but calculated for 
a half-cycle pulse (insets of d). 


(Fig. 3m, n) with CEP. This control is manifested in the spectral 
domain by wideband modulations of the broadband emitted spectra 
(Fig. 30, p) and their continuum shape to give structured ones in 
accordance with our experiments (Fig. 3d). Notably, the semiclassical 
simulation (Fig. 3j, p) seems to reproduce this trend better than the 
quantum-mechanical one (Fig. 3i, 0). This can be attributed to the fact 
that intraband current and interband polarization are inherently 
coupled in the semiconductor Bloch equations. 

The CEP-based manipulation of the emitted spectra emanating 
from an extreme nonlinear process has served for years as a paradigm 
of sub-cycle confinement and control in attosecond physics®’*. Our 
experiments and simulations support the notion that this paradigm is 
also applicable for strong-field-driven electron dynamics and EUV 
emission in bulk solids. As half-cycle drivers are confined to a de facto 
sub-femtosecond interval (duration t, < 1 fs), these broadband spec- 
tral modulations offer evidence of the attosecond control of the 
induced multi-PHz intraband currents, and represent a unique and 
generic tool for tracing coherent strong-field dynamics in solids. 

Further study of the coherence of the emitted EUV radiation 
included the measurement of the far-field spatial properties of the 
emerging radiation. The measurements reveal a well-behaved (ellipti- 
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Figure 4 | Coherent EUV source characteristics. a, Far-field spatial intensity 
profile of the EUV beam sampled by a detector placed at a distance of ~90 cm 
downstream from the source. b, x-axis profile (red curve) of a, and far-field 
EUV beam profile (blue curve) calculated with semiclassical simulations and 
free-space propagation. Error bars depict the standard deviations of the values 
extracted from several data sets recorded under identical conditions. c, EUV 
photon yield (1.5-cycle pulse) integrated over the range (15-40 eV) for SiO, 
(blue curve) and for the noble gases Ar (red curve) and Kr (green curve). d, EUV 
supercontinua in SiO, extending to ~40 eV driven by half-cycle pulses at 
different peak field strengths (given on curves). 


city ~0.93) EUV beam profile (Fig. 4a), whose dimensions and shape 
are in excellent agreement with those predicted by our semiclassical 
simulations and free-space propagation (Fig. 4b, blue curve), and sug- 
gest the full spatial coherence of the emitted EUV (see Supplementary 
Information section VI). 

To gain insights into the conversion efficiency of laser radiation to 
extreme ultraviolet, we compared the yield of the EUV emission in 
SiO, with that of two broadly used media in gas-phase attosecond 
physics, Kr and Ar (Fig. 4c), under identical conditions (see 
Supplementary Information section VII). Our study reveals that the 
yield of EUV emission from SiO, which is approximately 8 times 
higher than that of these gases in the range of applied optical fields 
(Fig. 1c), falls within a similar efficiency class (10 7-10 °) as that of 
gases”. 

Extension of the EUV continua to higher photon energies (up to 
40 eV or v ~ 9.7 PHz) is also possible by applying stronger field 
(~1.36 V A_!) to our samples (Fig. 4d). For an even stronger optical 
field, the ultrathin samples show degradation due to physical melting. 
However, tests on thicker samples showed suppression of melting 
effects and may soon enable advancement of the coherent emission 
towards the soft X-ray regime, catalysing further developments. 

The generation of broadband EUV radiation by driving the coher- 
ent nonlinear motion of electrons in bulk solids advances attosecond 
high-harmonic spectroscopy to the condensed phase, and promotes 
solid-state laser photonics and electronics to a new frequency regime 
with great potential for novel scientific and technological inquiry. It 
offers new possibilities for the tracing and attosecond control of 
strong-field electron dynamics in solids. 
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Robots have transformed many industries, most notably manufac- 
turing’, and have the power to deliver tremendous benefits to soci- 
ety, such as in search and rescue’, disaster response’, health care* 
and transportation®. They are also invaluable tools for scientific 
exploration in environments inaccessible to humans, from distant 
planets® to deep oceans’. A major obstacle to their widespread 
adoption in more complex environments outside factories is their 
fragility®®. Whereas animals can quickly adapt to injuries, current 
robots cannot ‘think outside the box’ to find a compensatory 
behaviour when they are damaged: they are limited to their pre- 
specified self-sensing abilities, can diagnose only anticipated fail- 
ure modes’, and require a pre-programmed contingency plan for 
every type of potential damage, an impracticality for complex 
robots®**. A promising approach to reducing robot fragility 
involves having robots learn appropriate behaviours in response 
to damage’®"’, but current techniques are slow even with small, 
constrained search spaces’’. Here we introduce an intelligent trial- 
and-error algorithm that allows robots to adapt to damage in less 
than two minutes in large search spaces without requiring self- 
diagnosis or pre-specified contingency plans. Before the robot is 
deployed, it uses a novel technique to create a detailed map of the 
space of high-performing behaviours. This map represents the 
robot’s prior knowledge about what behaviours it can perform 
and their value. When the robot is damaged, it uses this prior 
knowledge to guide a trial-and-error learning algorithm that con- 
ducts intelligent experiments to rapidly discover a behaviour that 
compensates for the damage. Experiments reveal successful adap- 
tations for a legged robot injured in five different ways, including 
damaged, broken, and missing legs, and for a robotic arm with 
joints broken in 14 different ways. This new algorithm will enable 
more robust, effective, autonomous robots, and may shed light on 
the principles that animals use to adapt to injury. 

Current damage recovery in deployed robots typically involves two 
phases: self-diagnosis, followed by selection of the best pre-designed 
contingency plan”’*"’*. Such self-diagnosing robots are expensive, 
because self-monitoring sensors are expensive, and are difficult to 
design, because robot engineers cannot foresee every possible situ- 
ation: this approach often fails either because the diagnosis is incor- 
rect’*"* or because an appropriate contingency plan is not provided’. 

Injured animals respond differently (see Fig. 1a): they learn by trial 
and error how to compensate for damage (for example, learning which 
limp minimizes pain)’*'’. Similarly, trial-and-error learning algo- 
rithms could allow robots to creatively discover compensatory beha- 
viours without being limited to their designers’ assumptions about 
how damage may occur and how to compensate for each type of 
damage. However, state-of-the-art learning algorithms are impractical 
because of the “curse of dimensionality”: the fastest algorithms, from 
fields like reinforcement learning’* and modular robotics'®"’, depend 
on human demonstrations” or take 15 min or more even for relatively 
small search spaces (for example, 6-12 parameters, requiring 15-30 
min"), Algorithms without these limitations take several hours’’. 
Damage recovery would be much more practical and effective if robots 


1,2,4,5,64. 
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adapted as creatively and quickly as animals do (for example, in less 
than 2 min) in larger search spaces and without expensive, self-dia- 
gnosing sensors. 

Here we show that rapid adaptation can be achieved by guiding an 
intelligent trial-and-error learning algorithm with an automatically 
generated, pre-computed behaviour—performance map that predicts 
the performance of thousands of different behaviours (Supplementary 
Video 1). Current learning algorithms either start with no knowledge 
of the search space’? or with minimal knowledge from a few human 
demonstrations'*"*. Our hypothesis is that animals understand the 
space of possible behaviours and their value from previous experi- 
ence’’, and that animals adapt by intelligently selecting tests that val- 
idate or invalidate whole families of promising compensatory 
behaviours. The key insight here is that robots could do the same. 

Our robots store knowledge from previous experience in the form of 
a map of the behaviour-performance space. Guided by this map, a 
damaged robot tries different types of behaviours that are predicted to 
perform well and, as tests are conducted, updates its estimates of the 
performance of those types of behaviours. The process ends when the 
robot predicts that the most effective behaviour has already been dis- 
covered. A key assumption is that information about many different 
behaviours of the undamaged robot will still be useful after damage, 
because some of these behaviours will still be functional despite the 
damage. The results of our experiments support this assumption for all 
the types of damage we tested, revealing that a robot can quickly discover 
a way to compensate for damage (for example, see Fig. 1c) without a 
detailed mechanistic understanding of its cause, as occurs with animals. 
We call this approach ‘Intelligent Trial and Error’ (IT&E) (see Fig. 1d). 

The behaviour—performance map is created using a novel algorithm 
and a simulation of the robot, which can be either a standard physics 
simulator or automatically discovered’*. The robot’s designers need 
only describe the dimensions of the space of possible behaviours and a 
performance measure. For instance, walking gaits could be described 
by how much each leg touches the ground (a behavioural measure) and 
speed (a performance measure). An alternative gait behavioural mea- 
sure could be the percentage of time a robot’s torso has positive pitch, 
roll, and yaw angles. For grasping, performance could be measured by 
the amount of surface contact, and it has been demonstrated that 90% 
of effective poses for the 21-degree-of-freedom human hand can be 
captured by a three-dimensional behavioural space describing the 
principal components of ways in which hand-poses commonly vary”. 
To fill in the behaviour-performance map, an optimization algorithm 
simultaneously searches for a high-performing solution at each 
point in the behavioural space (Fig. 2a, b and Extended Data Fig. 1). 
This step requires simulating millions of behaviours, but needs 
to be performed only once per robot design before deployment 
(Supplementary Methods). 

A low confidence is assigned to the predicted performance of beha- 
viours stored in this behaviour-performance map because they have 
not been tried in reality (Fig. 2b and Extended Data Fig. 1). During the 
robot’s mission, if performance drops below a user-defined threshold 
(owing either to damage or to a different environment), the robot 
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Figure 1 | Using the Intelligent Trial and Error (IT&E) algorithm, robots, 
like animals, can quickly adapt to recover from damage. a, Most animals 
can find a compensatory behaviour after an injury. Without relying on 
predefined compensatory behaviours, they learn how to avoid behaviours that 
are painful or no longer effective. Photo credit: Michael Lloyd/The Oregonian 
(2012). b, An undamaged, hexapod robot. RGB-D stands for red, green, 

blue, depth. c, One type of damage the hexapod may have to cope with 


selects the most promising behaviour from the behaviour—performance 
map, tests it, and measures its performance. The robot subsequently 
updates its prediction for that behaviour and nearby behaviours, 
assigns high confidence to these predictions (Fig. 2c, d and Extended 
Data Fig. 1), and continues the select-test-update process until it 
finds a satisfactory compensatory behaviour (Fig. 2e and Extended 
Data Fig. 1). 

All of these ideas are technically captured via a Gaussian process 
model’', which approximates the performance function with already 
acquired data, and a Bayesian optimization procedure”, which 
exploits this model to search for the maximum of the performance 
function (see Supplementary Methods). The robot selects which beha- 
viours to test by maximizing an information acquisition function that 
balances exploration (selecting points whose performance is uncer- 
tain) and exploitation (selecting points whose performance is expected 
to be high) (see Supplementary Methods). The selected behaviour is 
tested on the physical robot and the actual performance is recorded. 
The algorithm updates the expected performance of the tested beha- 
viour and lowers its uncertainty. These updates are propagated to 
neighbouring solutions in the behavioural space by updating the 
Gaussian process (Supplementary Methods). These updated perform- 
ance and confidence distributions affect which behaviour is tested 
next. This select-test-update loop repeats until the robot finds a beha- 
viour whose measured performance is greater than some user-defined 
percentage (here, 90%) of the best performance predicted for any 
behaviour in the behaviour-performance map (Supplementary 
Methods). 

We first tested our algorithm on a hexapod robot that needs to walk 
as fast as possible (Fig. 1b, d). The robot has 18 motors, an onboard 
computer, and a depth camera that allows the robot to estimate its 
walking speed (Supplementary Methods). The gait is parameterized by 
36 real-valued parameters (Supplementary Methods) that describe, for 
each joint, the amplitude of oscillation, phase shift, and duty cycle (the 
fraction of the period that the joint angle is on one side of the midline). 
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(broken leg). d, After damage occurs, in this case making the robot unable 

to walk fast and in a straight line, damage recovery via IT&E begins. The robot 
tests different types of behaviours from an automatically generated map of 
the behaviour—performance space. After each test, the robot updates its 
predictions of which behaviours will perform well despite the damage. In this 
way, the robot rapidly discovers an effective compensatory behaviour. 


The behaviour space is six-dimensional, where each dimension is the 
proportion of time the ith leg spends in contact with the ground (that 
is, the duty factor)’ (Supplementary Methods). 

The behaviour-performance map created contains approximately 
13,000 different gaits (Supplementary Video 2 shows examples). We 
tested our robot under six different conditions: undamaged (C1 in 
Fig. 3a), four different structural failures (C2-C5 in Fig. 3a), and a 
temporary leg repair (C6 in Fig. 3a). We compare the walking speed of 
resultant gaits with a widely used, classic, hand-designed tripod gait’ 
(Supplementary Methods). For each of the six damage conditions, we 
ran our adaptation step five times for each of eight independently 
generated behaviour—performance maps (with the default “duty factor’ 
behavioural description), leading to 6 X 5 X 8 = 240 experiments in 
total. We also ran our adaptation step five times on eight indepen- 
dently generated behaviour—performance maps defined by an alterna- 
tive behavioural description (‘body orientation’, see Supplementary 
Methods) on two damage conditions (Fig. 3b, c), leading to 2 X 5 X 
8 = 80 additional experiments. 

When the robot is undamaged (C1 in Fig. 3a), our approach yields 
dynamic gaits that are 30% faster than the classic reference gait 
(Fig. 3b, median 0.32ms ', 5th and 95th percentiles [0.26ms }; 
0.36 ms '] versus 0.24ms_'), suggesting that IT&E is a good search 
algorithm for automatically producing successful robot behaviours, 
putting aside damage recovery. In all the damage scenarios, the ref- 
erence gait is no longer effective (around 0.04ms_' for the four 
damage conditions, C2-C5 in Fig. 3b). After IT&E, the compensatory 
gaits achieve a reasonably fast speed (>0.15m s ') and are between 
three and seven times more efficient than the reference gait for that 
damage condition (gaits given in metres per second: 0.24 [0.18; 0.31] 
versus 0.04 for C2; 0.22 [0.18; 0.26] versus 0.03 for C3; 0.21 [0.17; 0.26] 
versus 0.04 for C4; 0.17 [0.12; 0.24] versus 0.05 for C5; and 0.3 [0.21; 
0.33] versus 0.12 for C6). 

These experiments demonstrate that IT&E allows the robot both to 
initially learn fast gaits and to recover reliably after physical damage. 
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Figure 2 | The two steps of IT&E. a, b, Creating the behaviour—performance 
map. A user reduces a high-dimensional search space to a low-dimensional 
behaviour space by defining dimensions along which behaviours vary. In 
simulation, the high-dimensional space is then automatically searched to find a 
high-performing behaviour at each point in the low-dimensional behaviour 
space, creating a behaviour—performance map of the performance potential of 
each location in the low-dimensional space. In our hexapod robot experiments, 
the behaviour space is six-dimensional, with each dimension representing the 
portion of time that one leg is in contact with the ground. The confidence 
regarding the accuracy of the predicted performance for each behaviour in the 
behaviour-performance map is initially low because no tests on the physical 
robot have been conducted. c-e, Adaptation step: after damage, the robot 
selects a promising behaviour, tests it, updates the predicted performance of 


Additional experiments reveal that these capabilities are substantially 
faster than state-of-the-art algorithms (Bayesian optimization and 
policy gradient, Extended Data Fig. 2), and that IT&E can help with 
another major challenge in robotics: adapting to new environments 
(such as differently sloped terrain, Extended Data Fig. 3). On the 
undamaged or repaired robot (C6 in Fig. 3a), IT&E learns a walking 
gait in less than 30 s (Fig. 3c, the undamaged robot takes 24 s [16 s; 41 s] 
in 3 [2; 5] physical trials and the repaired robot takes 29 s [16 s; 82 s] in 
3.5 [2; 10] trials). For the five damage scenarios, the robot adapts in 
approximately one minute (66 s [24 s; 134 s] in 8 [3; 16] trials). It is 
possible that for certain types of damage the prior information from 
the undamaged robot does not help, and could even hurt, in learning a 
compensatory behaviour (for example, if the map does not contain a 
compensatory behaviour). We did not find such a case in our experi- 
ments with the hexapod robot, but we did find a case on another robot 
in which the prior information provided little benefit (experiment 1 in 
the Supplementary Information). 

Our results are qualitatively unchanged when using different beha- 
vioural characterizations, including randomly choosing six descriptors 
among 63 possibilities (Fig. 3b, c and Extended Data Fig. 4). Additional 
experiments show that reducing the high-dimensional parameter 
space to a low-dimensional behaviour space via the behaviour-per- 
formance map is the key component for IT&E: standard Bayesian 
optimization in the original parameter space does not find working 
controllers (gaits) (Extended Data Fig. 2). We investigated how the 
behaviour—performance map is updated when the robot loses a leg (C4 
in Fig. 3a). Initially, the map predicts large areas of high performance. 
During adaptation, these areas disappear because the behaviours do 
not work well on the damaged robot. IT&E quickly identifies one of the 
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that behaviour in the behaviour—performance map, and allocates a high 
confidence to this performance prediction. The predicted performances of 
nearby behaviours—and confidence in those predictions—are likely to be 
similar to the tested behaviour and are thus updated accordingly. This select- 
test-update loop is repeated until a tested behaviour on the physical robot 
performs better than 90% of the best predicted performance in the behaviour- 
performance map, a value that can decrease with each test (Extended Data 
Fig. 1). The algorithm that selects which behaviour to test next balances 
between choosing the behaviour with the highest predicted performance and 
behaviours that are different from those tested so far. Overall, the IT&E 
approach presented here rapidly locates which types of behaviours are least 
affected by the damage to find an effective, compensatory behaviour. 


few remaining high-performance behaviours (Fig. 4 and Extended 
Data Figs 5 and 6). 

The same damage recovery approach can be applied to any robot, 
such as a robotic arm. We tested 14 different damage conditions with a 
planar, eight-jointed robotic arm (Fig. 3d—f and Extended Data Fig. 7). 
The behaviour-performance map’s behavioural dimensions are the x, 
y position of the hand at the end of the arm. To show that the map- 
generating performance measure can be different from the ultimate 
performance measure, and to encourage smooth arm movements, the 
performance measure during map-generation is minimizing the vari- 
ance of the eight specified motor angles (Supplementary Methods). 
During adaptation, performance is measured as distance to the target. 
As with the hexapod robot, our approach discovers a compensatory 
behaviour in less than 2 min, usually in less than 30 s, and with fewer 
than ten trials (Fig. 3f and Extended Data Fig. 7). 

Although animals do not use the specific algorithm we present, 
there are parallels between IT&E and animal learning. Damage recov- 
ery in animals may occur without learning—for instance, due to 
the built-in robustness of evolved control loops**—but if such pre- 
programmed robustness fails, many animals turn to learning’’. As in 
our robot, such learning probably exploits an animal’s intuitions about 
how its intact body works to experiment with different behaviours to 
find what works best. Also like animals”, IT&E allows the quick 
identification of working behaviours with a few, diverse tests instead 
of trying behaviours at random or trying small modifications to the 
best behaviour found so far. Additionally, the Bayesian optimization 
procedure followed by our robot appears similar to the technique 
employed by humans when they optimize an unknown function”’, 
and there is strong evidence that animal brains learn probability 
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Figure 3 | Main experiments and results. 
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a, Conditions tested on the physical hexapod robot. 
C1, The undamaged robot. C2, One leg is 
shortened by half. C3, One leg is unpowered. C4, 
One leg is missing. C5, Two legs are missing. C6, 
An imperfect, makeshift repair to the tip of one leg 
(by a human operator). b, Performance after 
adaptation. Box plots represent IT&E. The central 
circle is the median, the edges of the box are the 
25th and 75th percentiles, the whiskers extend to 
the most extreme data points that are not 
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performance of the handmade reference tripod gait 
(Supplementary Methods). Conditions C1-Cé6 are 
tested five times each for eight independently 
created behaviour-performance maps with the 
‘duty factor’ behaviour description (that is, 40 
experiments per damage condition, 
Supplementary Methods). Damage conditions C1 
and C3 are also tested five times each for eight 
independently created behaviour—performance 
maps with the ‘body orientation’ behaviour 
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Figure 4| An example behaviour—-performance map. This map stores high- 
performing behaviours at each point in a six-dimensional behaviour space. 
Each dimension is the portion of time that each leg is in contact with the 
ground. The behavioural space is discretized at five values for each dimension 
(0; 0.25; 0.5; 0.75 and 1). Each coloured pixel represents the highest-performing 
behaviour discovered during map creation at that point in the behaviour 
space. The matrices visualize the six-dimensional behavioural space in two 
dimensions according to the legend at the top left. The behaviour-performance 
map is created with a simulated robot (bottom left) in the Open Dynamics 
Engine physics simulator (http://www.ode.org). The left matrix is a 


distributions, combine them with prior knowledge, and act as 
Bayesian optimizers*®*”’. 

An additional parallel is that IT&E primes the robot for creativity 
during a motionless period, after which the generated ideas are tested. 
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description (Supplementary Methods). c, Time and 
number of trials required to adapt. Box plots 
represent IT&E. d, Robotic arm experiment. The 
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eight-jointed, planar robot arm has to drop a ball 
into a bin. e, Example conditions tested on the 
physical robotic arm. Cl, One joint is stuck at 45 
degrees. C2, One joint has a permanent 45° offset. 
C3, One broken and one offset joint. Each of these 
damage conditions was replicated with 15 
independent maps. A total of 14 conditions were 
tested (Extended Data Fig. 7). f, Time and number 
5 of trials required to reach within 5 cm of the bin 
centre. Each condition is tested with 15 
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pre-adaptation map produced by the map creation algorithm. During 
adaptation, the map is updated as tests are conducted (in this case, for the 
damage condition where the robot is missing one leg: C4 in Fig. 3a). The 
right matrix shows the state of the map after a compensatory behaviour is 
discovered. The arrows and white circles represent the order in which 
behaviours were tested on the physical robot. The red circle is the final, 
discovered, compensatory behaviour. Among other areas, high-performing 
behaviours can be found for the damaged robot in the first two columns of 
the third dimension. These columns represent behaviours that use the 
central-left leg least, which is the leg that is missing. 


This process is reminiscent of the finding that some animals start the 
day with new ideas that they may quickly disregard after experiment- 
ing with them”’, and more generally, that sleep improves creativity on 
cognitive tasks”’. A final parallel is that the simulator and Gaussian 
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process components of IT&E are two forms of predictive models, 
which are known to exist in animals'**°. Overall, we have shown that 
IT&E has parallels in biology and makes robots behave more like 
animals by endowing them with the ability to adapt rapidly to unfore- 
seen circumstances. 
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Extended Data Figure 1 | An overview of the IT&E algorithm. a, Behaviour- 
performance map creation. After being initialized with random controllers, the 
behavioural map (a2), which stores the highest-performing controller found so 
far of each behaviour type, is improved by repeating the process depicted in al 
until newly generated controllers are rarely good enough to be added to the map 
(here, after 40 million evaluations). This step, which occurs in simulation (and 
required roughly two weeks on one multi-core computer; see Supplementary 
Methods, section ‘Running time’), is computationally expensive, but only needs 
to be performed once per robot (or robot design) before the robot is deployed. 
b, Adaptation. In b1, each behaviour from the behaviour—performance map has 
an expected performance based on its performance in simulation (dark green 
line) and an estimate of uncertainty regarding this predicted performance (light 
green band). The actual performance on the now-damaged robot (black dashed 
line) is unknown to the algorithm. A behaviour is selected for the damaged 
robot to try. This selection is made by balancing exploitation—trying 


behaviours expected to perform well—and exploration—trying behaviours 
whose performance is uncertain (Supplementary Methods, section ‘Acquisition 
function’). Because all points initially have equal, maximal uncertainty, the first 
point chosen is that with the highest expected performance. Once this 
behaviour is tested on the physical robot (b4), the performance predicted for 
that behaviour is set to its actual performance, the uncertainty regarding that 
prediction is lowered, and the predictions for, and uncertainties about, nearby 
controllers are also updated (according to a Gaussian process model, see 
Supplementary Methods, section ‘Kernel function’), the results of which can be 
seen in b2. The process is then repeated until performance on the damaged 
robot is 90% or greater of the maximum expected performance for any 
behaviour (b3). This performance threshold (orange dashed line) lowers as the 
maximum expected performance (the highest point on the dark green line) is 
lowered, which occurs when physical tests on the robot underperform 
expectations, as occurred in b2. 
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Extended Data Figure 2 | The contribution of each subcomponent of the 
IT&E algorithm. a, Adaptation progress versus the number of robot trials. The 
walking speed achieved with IT&E and several ‘knockout’ variants that are 
missing one of the algorithm’s key components (defined in the table below the 
plots). Some variants (4 and 5) correspond to state-of-the-art learning 
algorithms (policy gradient*' and Bayesian optimization*”****), MAP-Elites 
(written by J.-B.M. and J.C;; ref. 35) is the algorithm that IT&E uses to create the 
behaviour-performance map (see Supplementary Methods). The bold lines 
represent the medians and the coloured areas extend to the 25th and 75th 
percentiles. b, c, Adaptation performance after 17 and 150 trials. Shown is the 
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speed of the compensatory behaviour discovered by each algorithm after 17 and 
150 evaluations on the simulated robot, respectively. For all panels, data are 
pooled across six damage conditions (the removal of each of the six legs in turn) 
for eight independently generated maps and replicated ten times (for a total 
of 480 replicates). P values are computed via a Wilcoxon ranksum test. The 
central circle is the median, the edges of the box show the 25th and 75th 
percentiles, the whiskers extend to the most extreme data points that are 

not considered outliers, and outliers are plotted individually. See experiment 2 
in the Supplementary Information for methods and analysis. 
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Extended Data Figure 3 | The IT&E algorithm is robust to environmental 
changes. Each plot shows both the performance and required adaptation time 
for IT&E when the robot must adapt to walk on terrains of different slopes. 
a, Adaptation performance on an undamaged, simulated robot. On all slope 
angles, with very few trials, the IT&E algorithm (pink shaded region) finds 
fast gaits that outperform the reference gait (black dashed line). We performed 
ten replicates for each of the eight maps and each one-degree increment 
between —20° and +20° degrees (a total of 3,280 experiments). b, Adaptation 
performance on a damaged, simulated robot. The robot is damaged by 
having each of its six legs removed in six different damage scenarios. Data 
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are pooled from all six of these damage conditions. For each damage condition, 
we performed ten replicates for each of the eight maps and each two-degree 
increment between —20° and +20° degrees (a total of 10,080 experiments). 
The median compensatory behaviour found via IT&E outperforms the median 
reference controller on all slope angles. The middle, black lines represent 
medians, while the colored areas extend to the 25th and 75th percentiles. In 
a, the black dashed line is the performance of a classic tripod gait for reference. 
In b, the reference gait is tried in all six damage conditions and its median 
(black line) and 25th and 75th percentiles (black coloured area) are shown. See 
experiment 3 in the Supplementary Information for methods and analysis. 
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Extended Data Figure 4 | The IT&E algorithm is largely robust to 
alternative choices of behaviour descriptors. a, b, The speed of the 
compensatory behaviour discovered by IT&E for various choices of behaviour 
descriptors. Performance is plotted after 17 and 150 evaluations in a and 

b, respectively. Experiments were performed on a simulated, damaged 
hexapod. The damaged robot has each of its six legs removed in six different 
damage scenarios. Data are pooled across all six damage conditions. As 
described in experiment 5 of the Supplementary Information, the evaluated 
behaviour descriptors characterize the following: (1) time each leg is in contact 
with the ground (‘Duty factor’); (2) orientation of the robot frame 
(‘Orientation’); (3) instantaneous velocity of the robot (‘Displacement’); (4) 
energy expended by the robot in walking (“Energy (Total)’, ‘Energy (Relative)’); 
(5) deviation from a straight line (‘Deviation’); (6) ground reaction force on 
each leg (‘GRF (Total)’, ‘“GRF (Relative)’); (7) the angle of each leg when it 
touches the ground (“Lower-leg angle (Pitch)’, “Lower-leg angle (Roll)’, “Lower- 
leg angle (Yaw)’); and (8) a random selection without replacement from 
subcomponents of all the available behaviour descriptors (1)-(7) (‘Random’). 


We created eight independently generated maps for each of 11 intentionally 
chosen behavioral descriptors (88 in total). We created one independently 
generated map for each of 20 randomly chosen behavioural descriptors 

(20 in total). For each of these 108 maps, we performed ten replicates of 
IT&E for each of the six damage conditions. In total, there were thus 

8 X 11 X 10 X 6 = 5,280 experiments for the intentionally chosen behavioural 
descriptors and 1 X 20 X 10 X 6 = 1,200 experiments for the randomly 
chosen behavioural descriptors. For the hand-designed reference gait (yellow) 
and the compensatory gaits found by the default duty factor behaviour 
descriptor (green), the bold lines represent the medians and the coloured areas 
extend to the 25th and 75th percentiles of the data. For the other treatments, 
including the duty factor treatment, black circles represent the median, the 
edges of the boxes show the 25th and 75th percentiles, the whiskers extend to 
the most extreme data points that are not considered outliers, and outliers 
are plotted individually. See experiment 5 in the Supplementary Information 
for methods and analysis. 
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Extended Data Figure 5 | How the behaviour performance map (data 
normalized) is explored to discover a compensatory behaviour. Data are 
normalized each iteration to highlight the range of remaining performance 
predictions. The robot is adapting to damage condition C4 (see Fig. 3a). 
Colours represent the performance prediction for each point in the map relative 
to the highest performing prediction in the map at that step of the process. A 
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black circle indicates the next behaviour to be tested on the physical robot. A red 
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circle indicates the behaviour that was just tested (note that the performance 
predictions surrounding it have changed versus the previous panel). Arrows 


reveal the order that points have been explored. The red circle in the last map is 


the final, selected, compensatory behaviour. In this scenario, the robot loses leg 
number 3. The six-dimensional space is visualized according to the inset legend. 
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Extended Data Figure 6 | How the behaviour performance map (data not 
normalized) is explored to discover a compensatory behaviour. The data are 
not normalized in order to highlight that performance predictions decrease as it 
is discovered that predictions from the simulated, undamaged robot do not 
work well on the damaged robot. The robot is adapting to damage condition C4 
(see Fig. 3a). Colours represent the performance prediction for each point in 
the map relative to the highest performing prediction in the first map. A black 
circle indicates the next behaviour to be tested on the physical robot. A red 
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circle indicates the behaviour that was just tested (note that the performance 
predictions surrounding it have changed versus the previous panel). Arrows 
reveal the order that points have been explored. The red circle in the last 
map in the sequence is the final, selected, compensatory behaviour. In this 
scenario, the robot loses leg number 3. The six-dimensional space is visualized 
according to the inset legend. The data visualized in this figure are identical 
to those in the previous figure: the difference is simply whether the data are 
renormalized for each new map in the sequence. 
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Extended Data Figure 7 | IT&E works on a completely different type of 
robot (the robotic arm experiment). a, The robotic arm experimental setup. 
b, Tested damage conditions. c, Example of behaviour performance maps 
(colour maps) and behaviours (overlaid arm configurations) obtained with 
MAP-Elites. Left: A typical behaviour—performance map produced by MAP- 
Elites with five example behaviours, where a behaviour is described by the angle 
of each of the eight joints. The colour of each point is a function of its 
performance (in radians squared), which is defined as having low variance in 
the joint angles (that is, a zigzag arm has lower performance than a straighter 
arm that reaches the same point). Right: Neighbouring points in the map 
tend to have similar behaviours, thanks to the performance function, which 
would penalize more jagged ways of reaching those points. That neighbours 
have similar behaviours justifies updating predictions about the performance 
of nearby behaviours after testing a single behaviour on the real (damaged) 
robot. d, Accuracy (in metres) versus trial number for IT&E and traditional 
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Bayesian optimization. The experiment was conducted on the physical robot, 
with 15 independent replications for each of the 14 damage conditions. 
Accuracy is pooled from all of these 14 X 15 = 210 experiments for each 
algorithm. The middle lines represent medians, while the coloured areas 
extend to the 25th and 75th percentiles. e, Success for each damage condition. 
Shown is the success rate for the 15 replications for each damage condition, 
defined as the percentage of replicates in which the robot reaches within 5 cm of 
the bin centre. f, Trials required to adapt. Shown is the number of iterations 
required to reach within 5 cm of the bin centre. g, Accuracy after 30 physical 
trials for each damage condition (with the stopping criterion disabled). For 

f and g, the central circle is the median, the edges of the box show the 25th 
and 75th percentiles, the whiskers extend to the most extreme data points 
that are not considered outliers, and outliers are plotted individually. See 
experiment 1 in the Supplementary Information for methods and analysis. 
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Ocean impact on decadal Atlantic climate variability 
revealed by sea-level observations 


Gerard D. McCarthy’, Ivan D. Haigh?, Joél J.-M. Hirschi', Jeremy P. Grist’ & David A. Smeed! 


Decadal variability is a notable feature of the Atlantic Ocean and 
the climate of the regions it influences. Prominently, this is man- 
ifested in the Atlantic Multidecadal Oscillation (AMO) in sea sur- 
face temperatures. Positive (negative) phases of the AMO coincide 
with warmer (colder) North Atlantic sea surface temperatures. The 
AMO is linked with decadal climate fluctuations, such as Indian 
and Sahel rainfall’, European summer precipitation’, Atlantic hur- 
ricanes* and variations in global temperatures*. It is widely 
believed that ocean circulation drives the phase changes of the 
AMO by controlling ocean heat content’. However, there are no 
direct observations of ocean circulation of sufficient length to sup- 
port this, leading to questions about whether the AMO is con- 
trolled from another source’. Here we provide observational 
evidence of the widely hypothesized link between ocean circulation 
and the AMO. We take a new approach, using sea level along the 
east coast of the United States to estimate ocean circulation on 
decadal timescales. We show that ocean circulation responds to 
the first mode of Atlantic atmospheric forcing, the North 
Atlantic Oscillation, through circulation changes between the sub- 
tropical and subpolar gyres—the intergyre region’. These circula- 
tion changes affect the decadal evolution of North Atlantic heat 
content and, consequently, the phases of the AMO. The Atlantic 
overturning circulation is declining*® and the AMO is moving to a 
negative phase. This may offer a brief respite from the persistent 
rise of global temperatures‘, but in the coupled system we describe, 
there are compensating effects. In this case, the negative AMO is 
associated with a continued acceleration of sea-level rise along the 
northeast coast of the United States”’®. 

The difficulty in linking ocean circulation changes to decadal cli- 
mate variations lies in the fact that long observational records of ocean 
transports are rare. Measurements such as those of the Florida Current 
since 1982"' and the Greenland-Scotland ridge transports’* since the 
mid-1990s are some of the longest continuous ocean transport records 
available. Continuous, full-depth, basin-wide measurements of the 
Atlantic overturning circulation only began in 2004 with the RAPID 
monitoring project at 26° N (ref. 13). None of these records are long 
enough to directly link ocean circulation with decadal climate varia- 
tions such as the AMO. 

Sea-level measurements from tide gauges provide an integrated 
measure of water column properties and offer timeseries of sufficient 
length (Extended Data Fig. 1) to study decadal ocean circulation var- 
iations. Investigating ocean circulation using tide gauges is not new: 
the first attempt to estimate the Gulf Stream using tide gauges was 
made in 1938". The principle is based on geostrophic dynamics: on 
timescales longer than a few days, ocean circulation is in geostrophic 
balance so, looking downstream, the sea level is seen to increase from 
left to right in the Northern Hemisphere. 

Estimates of the Gulf Stream using tide gauges have focused on the 
American east coast, with an offshore estimate of sea level from either 
an island gauge’* or a reconstructed sea level’®. A weakness of this 
method is that the offshore measurement lies in the eddy-filled ocean 


where sea-level fluctuations at any one point are influenced by 
mesoscale variations’” even on long timescales, increasing the difficulty 
of making estimates of ocean circulation that are coherent on large 
spatial scales. This is the case for sea level at Bermuda, whose decadal 
fluctuations can be reproduced by considering a Rossby wave response 
to wind forcing’®. To make estimates of ocean circulation that capture 
the fluctuations in large-scale circulation and are less affected by eddy 
variability, measurements close to or on the western boundary are 
necessary’*. We account for this by focusing on the gradient of sea level 
along the US east coast. Here the mean dynamic sea level decreases to 
the north (Fig. 1a) due to the transition from subtropical to subpolar 
gyres. This dynamic gradient reflects a circulation that contains ele- 
ments not only of the Gulf Stream but also of cold, subpolar water from 
the north, primarily associated with the overturning circulation”. 
Indeed, in model simulations, this meridional gradient of sea level along 
this coast responds strongly to declines in the Atlantic overturning 
circulation”’. Ultimately, it is the heat transport that we are interested 
in. And while the overturning circulation carries about 90% of the heat 
at subtropical latitudes’, ocean heat transport at the latitude of the 
intergyre region consists of similar contributions from both the over- 
turning circulation and the gyre circulation’. For this reason, we do not 
discuss separately overturning and gyre but only ocean circulation in 
this intergyre region, which contains elements of both mechanisms. 

Sea-level fluctuations from Florida to Boston divide into two coher- 
ent groups either side of Cape Hatteras” (Extended Data Figs 2, 3). This 
large-scale coherence in sea level is driven by ocean circulation. North 
of Cape Hatteras, coherent sea-level fluctuations have been linked with 
fluctuations in the overturning circulation’®*. South of Cape Hatteras, 
fluctuations in the Gulf Stream from Florida to Cape Hatteras are 
reflected in sea-level fluctuations. As Cape Hatteras marks the bound- 
ary between the subtropical and subpolar gyres on this coastline 
(Fig. 1a), we can construct a single sea-level composite representative 
of the subtropical (subpolar) circulation by averaging sea level from 
linearly detrended, deseasonalized tide gauges, with the inverse baro- 
meter effect removed, south (north) of the Cape (Fig. 1b, c). The dif- 
ference, south minus north (Fig. 1d), represents our circulation index. 
This index projects onto observed surface velocities during the satellite 
era in the intergyre region, with a positive index associated with more 
northwards flow and a more northerly path of this circulation 
(Extended Data Fig. 4). Similarly, in a high-resolution ocean model, 
over timescales that contain both the cool phase of the AMO in the 
1970s** and the warm phase of the 1990s”*, the sea-level index projects 
onto a similar pattern of circulation, with a positive index associated 
with more northward heat transport (Extended Data Fig. 5). 

Ocean circulation is proportional to heat transport at both subtrop- 
ical and subpolar latitudes’. A number of recent studies (see, for 
example, ref. 27) have emphasized the dominant role of ocean heat 
transport in heat content changes, relating the accumulation (in time) 
of heat transport to heat content. This suggests that the accumulation 
of our sea-level index across Cape Hatteras, as a proxy for ocean 
circulation, can be related to ocean heat content. The largest AMO 
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Figure 1 | Dynamic sea level and circulation along the western Atlantic 

seaboard. a, Negative (positive) mean dynamic topography contours in blue 
(red) indicate cyclonic (anticyclonic) geostrophic streamlines. Contour values 
in metres shown in Fig. 2. The zero contour (dark blue) marks the boundary 
between the subtropical and subpolar gyres. Hatched areas indicate warm sea 
surface temperature anomalies of greater than 0.5 °C during the positive phase 


signal is in the subpolar region (Fig. 1a), so we wish to show that, as a 
measure of ocean circulation, our sea-level index is related to heat 
transport into the subpolar gyre and consequently heat content 
changes there. Such a mechanism is supported by our model, in which 
the sea-level index leads the heat transport into the subpolar gyre 
at 40° N and, consequently, leads the heat content changes there 
(Extended Data Fig. 6). 

Although we do not have observations of heat transport, we can 
relate our sea-level index directly to the heat content changes in the 
subpolar gyre since 1960. Figure 2a shows the accumulated sea-level 
index (blue curve), together with a direct estimate of the heat content in 
the area in the depth-weighted temperature anomaly in the top 500 m 
between 40° N and 60° N (black line). Heat content trends are similar 
throughout the upper 1,000 m of the Atlantic, below which they reverse 
due to the depth structure of the Atlantic overturning circulation. The 
cool subpolar upper ocean of the 1970s and 1980s and subsequent 
warming in the 1990s is captured by the accumulated sea-level index, 
observationally supporting the hypothesis that circulation changes and 
not only air-sea fluxes were involved in these changes”. For the pur- 
poses of statistical analyses, the timeseries have had a 7-year low-pass, 
Tukey filter applied to them, which is referred to by the prefix ‘7-year’ 
from here on. The 7-year sea-level index leads the 7-year rate of 
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of the AMO from 1995-2004 relative to from 1961-2012. b, c, Dynamic sea- 
level anomalies north (b; sites 7-30, +200 mm offset) and south (c; sites 1-6, 
—200 mm offset) of Cape Hatteras, with averages in black. d, The difference in 
sea level, southern minus northern average, defines our sea-level index for 
ocean circulation. 


heat content change by 2 years with a maximum correlation of 0.58 
(significant at the 95% level). The reason that the accumulated sea-level 
index leads the large rise in heat content from 40° N to 60° N in the early 
1990s can be interpreted by looking at maps of the heat content anom- 
aly evolution. Heat content builds downstream of the intergyre region 
from the mid-1980s to the mid-1990s (Fig. 2b). This heat content 
anomaly is then observed downstream in the subpolar gyre in the late 
1990s and early 2000s (Fig. 2c), indicating that the sea-level index could 
provide an early indication of subpolar heat content change. 

The first mode of atmospheric variability over the North Atlantic, 
the North Atlantic Oscillation (NAO), forces both buoyancy and 
wind-driven ocean circulation’ and, we believe, is the major forcing 
of the circulation in the intergyre region. The 7-year NAO is signifi- 
cantly correlated with (r = 0.71 at the 98% level) and leads the 7-year 
sea-level difference by approximately 1 year over the period 1950 to 
2012. On extending the time period to 1920-2012, the correlation 
drops slightly but is still significantly correlated (r = 0.61 at the 98% 
level, Extended Data Fig. 7). The correlation between the sea-level 
difference and the NAO is higher and more significant than the 
correlation of the NAO with either the southern or northern sea-level 
(Fig. 1b, c) composites (r = —0.5 at the 86% level for the southern 
composite; r = —0.43 at the 70% level for the northern); this supports 
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Figure 2 | Relating the sea-level circulation index to heat content changes. 
a, Accumulated sea-level index (nominally, in mm month) derived from 
accumulating the sea-level circulation index (blue), temperature anomaly in the 
upper 500 m of the subpolar North Atlantic from 40° to 60° N (black) and 
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accumulated NAO (red, dashed). b, Average temperature anomaly in the top 
500 m for the periods 1985-94 relative to the average from 1958-2010. 
Contours of mean dynamic topography (metres) defined in Fig. 1a are overlaid 
for reference. c, Same as b but for the period 1995-2004. 
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Figure 3 | Sea-level circulation index, the NAO and the AMO on multi- 
decadal timescales. Shown are the accumulated sea-level index (blue), which is 
representative of subpolar heat content evolution, the accumulated NAO (red, 
dashed) and the AMO (black). The heat content proxy and the accumulated 
NAO have been normalized. All timeseries have been 7-year low-pass filtered. 
The accumulated sea-level index and accumulated NAO have been detrended. 


our hypothesis that the NAO forces the ocean circulation and conse- 
quently the ocean heat transport into the subpolar gyre. 

In the past 90 years, the AMO has undergone three major transi- 
tions: warming in the mid-1990s and 1920s, and a cooling in the 1960s. 
From the early-1920s, when the tide gauge network along the east coast 
of North America was developed, robust comparisons of our sea-level 
index to the AMO are possible (Fig. 3). The accumulated sea-level 
index and the accumulated NAO are linearly detrended and capture 
much of the multi-decadal variation. The 7-year sea-level index leads 
the 7-year rate of change of the AMO by 2 years and is significantly 
correlated (r = 0.51, significant at the 96% level, Extended Data Fig. 8). 
This lead time of 2 years remains consistent when the timeseries is 
broken into 60-year blocks (Extended Data Table 2). In recent years, 
the sea-level index (Fig. 1d) indicates that the AMO is again transition- 
ing to a negative phase, consistent with observations of a reduced 
overturning circulation’. 

Using the sea-level difference between subtropical and subpolar 
gyres, we have developed and validated a proxy for ocean circulation 
in the intergyre region. This represents a mechanism for ocean heat 
transport to the subpolar gyre and heat content changes there. When 
observations exist, heat content changes have coincided with the major 
phase transitions of the AMO, confirming that ocean circulation plays a 
key role in decadal Atlantic variability. The ocean responds to NAO 
forcing with changes in ocean circulation: on decadal timescales, the 
ocean integrates NAO forcing and returns it to the atmosphere as the 
AMO. This is implicitly the Bjerknes compensation that had previously 
been seen in air-sea fluxes”. The sea-level difference provides an indi- 
cator of ocean circulation changes that precede phase changes in the 
AMO, thus explaining why, as the positive AMO declines’, sea-level 
rise is accelerating north of Cape Hatteras”'®. While Greenland ice sheet 
melt has been linked with accelerating sea-level rise in recent years, the 
fact that the period of accelerated sea-level rise from the 1950s to the 
1970s'° (as well as the current period of sea-level rise) coincides with a 
declining AMO indicates that multi-decadal fluctuations in ocean cir- 
culation play a key role. In this framework, sea-level rise along the US 
east coast becomes entwined with the effects of the AMO on climate. 


Online Content Methods, along with any additional Extended Data display items 


and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Data. Monthly mean sea-level records were obtained from the Permanent Service 
for Mean Sea-level (www.psmsl.org) for tide gauges stretching from Florida to 
Boston (locations 1 to 30, Extended Data Fig. 1). Linear trends were removed from 
each record. This removes the impact of glacial isostatic adjustment and other land 
subsidence effects, which have time periods of thousands of years and are known 
to affect tide gauges along this coastline. A 12-month low-pass filter removed the 
seasonal cycle. Southern (northern) composites of sea level were calculated by 
averaging records 1-6 (7-30). The meridional coherence of sea-level fluctuations 
is such that using just a single tide gauge results in an r.m.s. error of only 5 mm 
relative to the full composite. Finally, the sea-level index is simply the difference 
obtained by subtracting the northern from the southern sea-level composite. The 
high level of meridional coherence allows the interpretation of the sea-level gra- 
dient as this simple index. 
Sources. Monthly NAO data from the National Center for Atmospheric Research 
“The Climate Data Guide: Hurrell North Atlantic Oscillation (NAO) Index (PC- 
based)” (https://climatedataguide.ucar.edu/climate-data/hurrell-north-atlantic- 
oscillation-nao-index-pc-based); monthly AMO index, based on the Kaplan sea 
surface temperature (SST) data set (from http://www.esrl.noaa.gov/psd/data/ 
timeseries/AMO/); subsurface temperature data from the EN3 product (http:// 
www.metoffice.gov.uk/hadobs/en3/); geostrophic velocity anomalies were pro- 
duced and distributed by Aviso (http://www.aviso.altimetry.fr/) as part of the 
Ssalto ground processing segment. CNES-CLS09 Mean Dynamic Topography 
(v1.1 release) for the period 1993-99 was produced by the French Space Agency 
CNES. 
Model validation. The multi-decadal oscillation of SSTs is most intense in the 
subpolar gyre (Fig. 1a). Modelling studies have shown that it is ocean heat trans- 
port into the subpolar gyre (here we choose 40° N) that controls the heat content of 
the subpolar upper ocean and consequently the SST. The concept here is that 
circulation in the intergyre region reflects the balance between warm subtropical 
water entering the subpolar gyre and colder subpolar water being recirculated 
within the gyre. We show that the sea-level gradient along the US east coast is a 
good proxy for this circulation (Extended Data Figs 4 and 5). 

We can relate sea-level changes to ocean circulation in a reduced gravity geo- 
strophic framework: 


I 
v= kxVh 
f 


where v is geostrophic velocity, k is the unit vector in the vertical direction, h is 
sea level, g’ is reduced gravity and fis the Coriolis parameter. To estimate the 
transport in the intergyre region, previous studies have considered the sea-level 
difference between an onshore tide gauge and an offshore tide gauge, such as 
Bermuda. Reference 24, for example, relates the sea-level difference between 
Atlantic City and Bermuda to the Atlantic overturning circulation. 

However, Bermuda is in the eddy-filled ocean interior’, which can disrupt 
spatially-coherent ocean transport signals. Our approach is to use sea-level esti- 
mates south of Cape Hatteras instead of an offshore sea-level estimate. Dynamic 
topography along the US east coast also decreases to the north across the intergyre 
boundary at Cape Hatteras much as it decreases from Bermuda to Atlantic City. 
However, measurements on the coast do not suffer the same contamination due to 
eddies as mid-ocean measurements'*. Hence we estimate the transport along the 
intergyre boundary as: 


Vig Ch, — hy 


where the subscript ig refers to the intergyre region, s and n refer to south and 
north respectively. We can formulate the heat transport through a section strad- 
dling the intergyre boundary as: 


HTig = pcp \| OvigdA 


where p is density, cp is the specific heat capacity of seawater, O is conservative 
temperature and A is the area of the section considered. In this study we assume 
that the velocity fluctuations dominate the temperature fluctuations and so set the 
heat transport directly proportional to the intergyre velocity. This is an assump- 
tion that has proved true in direct heat transport estimates”'. We note there is no 
dilemma in picking the location of the northern or southern points as the meri- 
dional coherence of sea-level fluctuations allows us to use a simple average of all 
sea-level records from Miami Beach to Cape Hatteras (Cape Hatteras to Boston) 
for h, (h,). 

In terms of upper ocean heat content, the heat transported in this intergyre 
region has a profound impact on the subpolar gyre. This is because warm water 
may be transferred from the upper waters of the subtropics to the subpolar gyre 
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whereas subpolar water can only enter the subtropics at depth (traditionally in the 
deep western boundary current). Therefore we relate the heat transport into 
the subpolar gyre and heat content of the upper waters of the subpolar gyre 
to the transport in the intergyre region: 


HT yey Ch, — hy 


For exactly the reason that we need to use tide gauges as a proxy for heat 
transport, we cannot validate the conceptual model directly due to the lack of 
direct observations. However, a global eddy-permitting (1/4°) ocean model 
(ORCA-025) provides the framework to investigate these balances. The heat 
transport into the subpolar gyre has previously been shown in this model to be 
the dominant factor in setting upper ocean temperature in the subpolar gyre”. 
Here, we reproduce this result, showing that the accumulated heat transport across 
40° N captures the major decadal fluctuations in heat content of the subpolar gyre 
(Extended Data Fig. 6). We can use these heat transport measurements to validate 
our circulation index. At this resolution there are shortcomings in the representa- 
tion of the Gulf Stream path: the Gulf Stream overshoots at Cape Hatteras 
and separates from the US coast too far north. However, we take account of 
this in choosing the northern and southern sea-level points so that they 
straddle the separation point. Also, despite the model being eddy-permitting 
rather than eddy-resolving, it does generate mesoscale variability. This is seen 
when including an offshore sea-level measurement (such as Bermuda) in a sea- 
level circulation index. Such an index fails to reflect the large scale circulation. This 
effect would be expected to be even larger in an eddy-resolving model. Extended 
Data Fig. 5 shows that the model-derived sea-level index projects onto the inter- 
gyre velocities in a similar manner to the observed sea-level index. Extended Data 
Fig. 6 shows the accumulated sea-level difference compared with the accumulated 
heat transport across a section near 40° N and the volume averaged temperature of 
the upper 500 m of the subpolar gyre (40° N to 60° N). The sea-level difference is 
significantly correlated with the heat transport into the subpolar gyre (r = 0.62) 
and leads by 5 years (as in the main text, we report statistics on unaccumulated 
timeseries). 

Code availability. Matlab code is available to download from http://bit.ly/ 
1F7gtps. 

Statistical analysis. Cross-correlations are calculated using annually averaged 
data after first removing the mean and linear trend from each variable. Two 
approaches are used to quantify the uncertainty in the correlation. First, we cal- 
culated the parameter 


T=./(N—2) = 
—r 

where r is the correlation and N is the number of samples. The distribution of T is 
assumed to have a t-distribution with N — 2 degrees of freedom when the samples 
are not autocorrelated. This is used with a one-sided test to estimate the likelihood 
that the correlation has not occurred by chance (that is, the certainty with which 
we can reject the null hypothesis). Our data are autocorrelated and the number of 
independent samples (degrees of freedom) is therefore smaller than N. To cal- 
culate the effective number of degrees of freedom we follow ref. 30 by evaluating 
the autocorrelation of each variable and the estimate N as 


(1—a,a) 
(1+ aia) 


where Nr is the degrees of freedom, Nop; is the number of observations and ay, a2 
are the values of the autocorrelations at a lag of one year. We evaluated Nr over the 
longest time for each variable and then used the lowest value for all correlations. 
For the shorter time series N.¢ was reduced in proportion to the length of the 
series. Degrees of freedom are reported in Extended Data Table 1. 


Nese = Nobs 


In a second approach we applied the non-parametric method described in 
ref. 31. A large number (we used 10,000) of simulated time series are con- 
structed from the Fourier transform of one of the original data series by 
preserving the modulus of each Fourier component but changing the phase 
to a random value between 0 and 27. The distribution of correlations between 
these random series and the second variable was then calculated. The percent- 
age of simulated correlations that are less than the observed correlation indi- 
cates the confidence that the true correlation is greater than zero. Because we 
are considering lagged correlations we modify the technique of ref. 31 so that 
for each simulated time series we evaluate the maximum of cross-correlation 
across all lags rather than the correlation at zero lag only. This provides a more 
stringent test of confidence. 

To estimate the uncertainty in the time lag of the maximum correlation we used 
the times at which the correlation was equal to the maximum value less the 
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standard deviation of correlations derived from the simulated time series. The 
results are summarized in Extended Data Table 1. 

We have also evaluated the correlation over shorter periods to determine if 
the lag has remained constant over time. Results from three overlapping 60-year 
periods are shown in Extended Data Table 2. For each the correlation is a max- 
imum when sea-level difference leads the differentiated AMO by 2 to 3 years. 

The text refers to both accumulated and unaccumulated timeseries. Accumu- 
lation of zero mean timeseries constrains the beginning and end of the accumu- 
lated timeseries to zero. To avoid this arbitrary constraint, we report all 
our statistics on unaccumulated timeseries. As mentioned, for the purposes of 


statistical analyses, the timeseries have had a 7-year, Tukey filter applied to them, 
which is referred to in the text with the prefix ‘7-year’. 

Sample size. The timespan of the study was the maximum for which all of the 
necessary data were available. Therefore no statistical methods were used to pre- 
determine sample size as we used all the samples available to us. 


30. Bretherton, C. S., Widmann, M., Dymnikoy, V. P., Wallace, J. M. & Bladé, I. The 
effective number of spatial degrees of freedom of a time-varying field. J. Clim. 12, 
1990-2009 (1999). 

31. Ebisuzaki,W.A method to estimate the statistical significance of a correlation when 
the data are serially correlated. J. Clim. 10, 2147-2153 (1997). 
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Extended Data Figure 1 | Tide gauges used in this study. a, Locations and b, temporal coverage of the tide gauges used in this study. 
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Extended Data Figure 2 | Dynamic sea-level anomalies from the 30 stations _tide gauges along this coastline. A seasonal cycle was removed using a 

used in this study. Linear trends were removed from each record. This 12-month boxcar filter. From 1920, there are multiple tide gauges both north 
removes the impact of glacial isostatic adjustment and other land subsidence _—_and south of Cape Hatteras, so this is when we begin our study. 

effects, which have time periods of thousands of years and are known to affect 
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Extended Data Figure 3 | Correlation of tide gauges along the US east coast relative to one another. The dashed line indicates the location of Cape Hatteras. 


There is high correlation between tide gauges grouped north and south of Cape Hatteras. 
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Extended Data Figure 4 | Surface velocity anomaly when the sea-level index —_ with a more northerly circulation in the intergyre region and increased surface 
is positive. a, Magnitude (ms) andb, zonally integrated meridional velocity _ flow into the subpolar gyre. Velocities are geostrophic surface velocities derived 
anomalies (10° m* s_') for the time period 1993 to 2011, corresponding to from satellite altimetry. 

when (c) the sea-level index is positive. A positive sea-level index is associated 
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Extended Data Figure 5 | Model-derived surface velocity anomaly 
magnitude when the model-based sea-level index is positive. Similar to 
observed velocities, positive indices are associated with more northerly 
circulation in the intergyre region. a, Surface velocity magnitude (ms _') and 
b, percentage of meridional heat transport change (%) for the time period 1958 


to 2001, corresponding to when (c) the model-derived sea-level index is 
positive. Similar to the satellite observations, a positive sea-level index is 
associated with a more northerly circulation in the intergyre region. Meridional 
heat transport change in both subtropical and subpolar gyres is positive when 
the sea-level index is positive. 
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Extended Data Figure 6 | Model-derived sea-level index, heat transport gyre across 40° N (Acc. HT on; black, normalized units). The heat transport 
and subpolar heat content. The accumulated sea-level index (Acc. SL diff, into the subpolar gyre dominates the top 500 m temperature anomaly 


blue, in mm months) leads the accumulated heat transport into the subpolar — (Subpolar HCA, green, °C) in the subpolar gyre. 
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c, Scrambled correlation tests. The histogram indicates the typical correlations 
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Extended Data Figure 8 | Relationship between sea-level index and the rate _ that would be expected from randomly generated timeseries with similar 
of change of the AMO. a, 7-year sea-level difference (blue) and rate of change _ spectral properties to the original timeseries. The red line indicates the 


of the AMO (green). b, Lagged correlations between the two quantities. 


maximum correlation between the two timeseries. 


c, Scrambled correlation tests. The histogram indicates the typical correlations 
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Extended Data Table 1 | Sea level, NAO and rates of change of the AMO statistics 
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Var X Var Y Filt DoF | Corr | Sig. % 


(yrs) (t-stat) 


Time 
interval 


Di HC 


1950-2012 | 9 98 


RMS Lag at | Estimate 

of rand | max d range 

corr corr of lag 
(yrs) | (yrs) 


1950-2012 | 9 86 84 

1920-2012 | 13 98 99 0 
B-A Di AMO | 7 1920-2012 | 13 0.51 | 96 98 0.18 -2 -4 to -1 
NAO DiAMO | 7 1920-2012 | 13 0.58 | 98 98 0.21 -4 -5 to -2 


Correlation, lags and significance between the southern (B) and northern (A) sea-level indices. First variable (Var X) and second variable (Var Y) are indicated in the first two columns. In tl 
tered with a 7-year low-pass filter (Filt). Degrees of Freedom (DoF), correlation (Corr) and significance (Sig.) are 


to the rate of change, and HC refers to subpolar heat content from 40° to 60° N. All timeseries are fi 


he second column, Di refers 


shown. Significance is determined by a t-statistic (t-stat) ora scrambled test (Scrm). The RMS of a random correlation (column 9) together with the lag at maximum correlation (column 10) allows determination of 


a range for the lag estimate (column 11). 
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Extended Data Table 2 | Sub-sampled statistics 


From To Correlation Lag (yrs) Lag range (yrs) 
1920 2012 0.5 -2 -4 to -1 


1952 2012 0.54 -2 -4 to 0 


Correlation, lags and lag range of sea-level index and the rate of change of the AMO over various time periods to investigate the consistency of the lags. Time periods under consideration are indicated in the first two 
columns. 
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Diverse coupling of neurons to populations in 


sensory cortex 


Michael Okun!*?, Nicholas A. Steinmetz'?**, Lee Cossell*°, M. Florencia lacaruso”°, Ho Ko*+, Péter Barth6°t, Tirin Moore’, 
Sonja B. Hofer”, Thomas D. Mrsic-Flogel”°, Matteo Carandini*§ & Kenneth D. Harris’”°s 


A large population of neurons can, in principle, produce an astro- 
nomical number of distinct firing patterns. In cortex, however, 
these patterns lie in a space of lower dimension’, as if individual 
neurons were “obedient members of a huge orchestra”. Here we use 
recordings from the visual cortex of mouse (Mus musculus) and 
monkey (Macaca mulatta) to investigate the relationship between 
individual neurons and the population, and to establish the under- 
lying circuit mechanisms. We show that neighbouring neurons can 
differ in their coupling to the overall firing of the population, rang- 
ing from strongly coupled ‘choristers’ to weakly coupled ‘soloists’. 
Population coupling is largely independent of sensory preferences, 
and it is a fixed cellular attribute, invariant to stimulus conditions. 
Neurons with high population coupling are more strongly affected 
by non-sensory behavioural variables such as motor intention. Pop- 
ulation coupling reflects a causal relationship, predicting the res- 
ponse of a neuron to optogenetically driven increases in local activity. 
Moreover, population coupling indicates synaptic connectivity; the 
population coupling of a neuron, measured in vivo, predicted sub- 
sequent in vitro estimates of the number of synapses received from 
its neighbours. Finally, population coupling provides a compact sum- 
maty of population activity; knowledge of the population couplings 
of nneurons predicts a substantial portion of their n° pairwise cor- 
relations. Population coupling therefore represents a novel, simple 
measure that characterizes the relationship of each neuron to a 
larger population, explaining seemingly complex network firing pat- 
terns in terms of basic circuit variables. 

The cortex represents its computations through the joint activity of 
multiple neurons. This activity can be remarkably diverse even among 
neighbouring neurons, belonging to the same morphological and lam- 
inar cell class. In sensory cortex, neighbouring neurons not only respond 
to diverse stimulus features**, but also use diverse strategies to encode 
information. For example, mean firing rate differs by orders of magni- 
tude across neurons” "’, and it appears to constitute an invariant prop- 
erty of each cell, persisting across multiple stimulus conditions and 
spontaneous activity’*. We asked whether there are other invariant 
dimensions that characterize the diversity of firing of cortical neurons. 
Ideally, such dimensions would not only help explain the complex pat- 
terns of activity produced by cortical populations, but also relate directly 
to underlying circuit variables. 

To characterize how different cortical neurons relate to large-scale 
firing patterns, we considered their relationship to the ‘population rate’, 
that is, the summed activity ofall neurons in the recorded population at 
any moment in time’*’*. We recorded first from populations of 20-80 
neurons in deep layers of area V1 in awake head-fixed mice using multi- 
site silicon probes (Fig. 1a). Consistent with previous reports’”-’, popu- 
lation rate was not strictly controlled by sensory stimulation, but showed 


coherent fluctuations even spontaneously in the absence of sensory 
stimuli (Fig. 1b, c), closely tracking the simultaneously recorded local 
field potential (LFP, Fig. 1d). 

During spontaneous activity, neighbouring neurons differed widely 
in their coupling to the population. The spike-triggered population rate 
typically had a single peak centred close to zero (Fig. le and Extended 
Data Fig. 1), whose height was large for some neurons and small or even 
reversed for others. The variation in population coupling across neu- 
rons was continuous, and neurons did not appear to fall into discrete 
classes of distinct coupling. However, this diversity in coupling repre- 
sented a robust property of individual cells; computing coupling from 
separate data segments yielded highly consistent results (Extended Data 
Fig. 2a). A similar result was seen when correlating spikes with the LFP 
(Fig. 1f and Extended Data Fig. 2b). These variations in population 
coupling were not trivially explained by differences in mean firing rate; 
shuffling the spike times across neurons and time, while preserving each 
neuron’s mean rate and the population rate distribution”, destroyed the 
diversity in population coupling between neurons (Fig. 1g and Extended 
Data Fig. 2c). Moreover, diversity in population coupling was not restricted 
to mouse V1; it was also observed in monkey area V4 (Fig. 1h) and in 
rat (Rattus norvegicus) auditory cortex (Extended Data Fig. 3). 

Population coupling differed both between and within cell classes. 
On the basis of spike shape”, we classified ~ 13% of the cells in mouse 
V1 as narrow-spiking putative interneurons (Extended Data Fig. 4a), 
with the remainder primarily pyramidal cells. Narrow-spiking cells tended 
to have higher population coupling than wide-spiking cells (medians of 
0.84 and 0.50; P< 10 °). Nevertheless, this difference between classes 
was substantially smaller than the variability within each class (Extended 
Data Fig. 4). Strongly coupled neurons were more likely to exhibit burst- 
ing and to have a lower firing rate. However, even cells with similar firing 
rate and bursting dynamics could show great variation in coupling (Ex- 
tended Data Fig. 4b-d). 

We next asked to what degree population coupling can explain the 
patterns of spontaneous correlated activity in a population. To this end, 
we generated random population activity patterns, constrained so that 
each neuron’s population coupling and mean rate matched the original 
data, as did the distribution of population rate over time (Fig. 2a). We 
found that pairwise correlations in the resulting synthetic activity resem- 
bled those observed in the recorded activity (Fig. 2b). The accuracy of 
this prediction was higher when the population rate showed stronger 
variance, indicating that the single statistic of coupling was able to cap- 
ture the correlations induced by globally coordinated fluctuations (Fig. 2c). 
If coupling was not included, however, the performance of the model 
was substantially impaired (Fig. 2c, d). 

These results indicate that much of the pairwise correlation in the 
population is explained by the coupling of each neuron to population 
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Figure 1 | Neighbouring neurons differ markedly in population coupling 
during spontaneous activity. a, Schematic of a single shank of silicon 
electrode array, and spike waveforms of four example wide-spiking neighbour 
neurons measured with the array in deep layers of V1 of an awake mouse. 

b, Population raster of spontaneous activity in 66 neurons recorded from the 
whole array. Cells are arranged vertically in order of population coupling. 
Arrows indicate the four example neurons shown in a. c, Population rate 
measured by summing all the spikes detected on the entire array. d, LFP 
measured on a shank adjacent to that on which the example neurons were 
recorded (LFP waveforms were similar across shanks). e, Spike-triggered 
population rate (stPR) for the four example neurons. The spike train of each 


rate, with coupling strengths that vary between neurons. The underly- 
ing model is parsimonious, requiring only order n parameters to pre- 
dict order n” pairwise correlations. Moreover, the model is intuitive, 
involving procedures among the simplest in neuroscience—summing 
the activity of multiple neurons, and correlating the spike train of each 
neuron with the result. To assess whether more advanced procedures 
would yield different results, we used a variant of latent variable ana- 
lysis designed for discrete spike count data*”’ to obtain the weights of 
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Figure 2 | A simple model based on population coupling predicts the 
structure of pairwise correlations in a cortical population. a, The model 
generates random spike patterns subject to three constraints: that the 
population coupling of each neuron, the mean firing rate of each neuron, and 
the distribution of the population rate must match those in the original data. 
b, Random activity generated by the model produces pairwise correlations that 
are similar to those measured in the original spike trains (n = 67 units in 

one experiment; correlations computed in 20-ms bins). The upper triangle 
shows observed pairwise correlations, and the lower one shows pairwise 
correlations predicted by the model. Neurons are arranged in order of 
population coupling. The values on the diagonal (all 1s) have been removed. 
Similarity of observed and predicted correlations is indicated by the symmetry 
of the upper and lower triangles. c, Percentage of explainable correlation 
structure predicted, as a function of the variability of population rate (filled 
symbols, see Methods). The model captures pairwise correlations, but only in 
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neuron was excluded from the population rate before computing its stPR. 

f, The spike-triggered local field potential (stLFP) for the four example cells 
(inverted for ease of comparison) resembles their stPR (shown in e). Inset, 
normalized magnitudes of stPR and stLFP (see Methods) are highly correlated 
across cells (9 = —0.71, P< 10° rank correlation, n = 431 neurons). 

g, Differences in population coupling disappear after shuffling spikes in a 
manner that preserves each neuron’s mean firing rate and the population rate. 
Inset, population couplings in the actual spike trains (red) and after shuffling 
(grey), for neurons from all experiments. h, stPR of four example neurons 
simultaneously recorded in primate area V4, computed as in e. 


individual neurons to the first detected factor. Reassuringly, these weights 
were highly correlated with population coupling (Extended Data Fig. 5a); 
latent variable analysis found the same basic structure as our simple 
coupling model. 

The ability of the model to predict correlations may appear surprising 
given that it operates without knowledge of the neurons’ sensory tuning. 
In primary sensory areas, neurons with similar sensory selectivity show 
stronger stimulus-independent correlations’**””, and we observed a 
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experiments in which the population rate fluctuates. It cannot predict them 
when population rate is mostly constant (a highly desynchronized cortical 
state). Recordings were obtained from mouse V1 in wakefulness (diamonds) 
or under anaesthesia (circles), or from Al of awake rat (squares), all 
spontaneous activity; note that a variety of states is observed in all conditions. 
Open symbols show predictions of a model that ignores population coupling. 
The example experiment in b is shown in red. d, Same as b for predictions 
made without using population coupling. Such predictions fail to capture the 
structure of pairwise correlations (open markers in c). e, The model cannot 
predict a relationship between similarity of preferred orientation and 
spontaneous pairwise correlations (P = 0.15, Pearson correlation). f, As a 
result, this correlation is retained in the residual pairwise correlations obtained 
by subtracting the modelled from actual correlations (p = 0.26, P< 10°, 
Pearson correlation), indicating that the predictions of coupling and 
orientation sum linearly. The black line in f shows regression on cos(2AQ). 
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significant relationship between orientation preference similarity and 
spontaneous correlations (Pearson correlation p = 0.26, P< 10 °, for 
n= 163 pairs of cells with orientation selectivity index >0.6). The 
coupling model could not predict this relationship; indeed, its predic- 
tions of correlations were independent of orientation preference sim- 
ilarity (P = 0.15, Fig. 2e). However, population coupling represented a 
much better predictor of pairwise correlations than orientation pref- 
erence similarity (9 = 0.63, P< 10°); moreover, the two predictors 
are independent, representing complementary predictions that sum to 
explain the fine structure of pairwise correlations within a population 
(Fig. 2). 

Population coupling was invariant to visual stimulation, and showed 
little relation to basic visual properties. We measured population coup- 
ling across neurons during responses to natural movies, and found that 
it closely resembled that obtained during spontaneous activity (Fig. 3a, b 
and Extended Data Fig. 6a, b). Differences in coupling, therefore, reflect 
invariant properties of each neuron, rather than differences in the 
responses of each neuron to a specific set of stimuli. Both strongly and 
weakly coupled cells could respond to natural movies, showing reliable 
modulation by visual stimuli (Fig. 3c, d). In responses to gratings, more- 
over, population coupling showed no relationship to orientation select- 
ivity, spatial frequency preference, or linearity of spatial summation 
(Extended Data Fig. 6c-e). 

However, population coupling predicted the overall increase in mean 
firing rate caused by a visual stimulus. Strongly coupled cells exhibited 
a larger increase in mean firing rate during visual stimulation, both in 
responses to natural stimuli (Fig. 3c—-e; p = 0.38, P< 10 n= 431) 
and in responses to drifting gratings (p = 0.32, P = 2 X 10° °,n=217; 
Extended Data Fig. 6f). Thus population coupling—a quantity defined 
on a time scale of tens of milliseconds, and measurable during spon- 
taneous activity—predicts the temporally extended, nonspecific eleva- 
tion in firing rate seen during sensory stimulation. 

These correlates of population coupling do not prove that the relation- 
ship between population rate and individual cells’ firing is causal. To 
test causality, we used transgenic mice that express channelrhodopsin-2 
in a subset of layer 5 pyramidal neurons”. Light stimulation (0.5-s 
pulses at ~2 mW per mm using a blue LED) strongly increased network 
activity (Fig. 3f). A control experiment (brief laser pulses at 100 mW 
per mm”) indicated that in most cells this increase was driven synapti- 
cally. Only a few cells (~5%) reliably responded at short enough laten- 
cies (<5 ms) to indicate direct optogenetic depolarization. While strongly 
coupled cells substantially increased their firing in response to LED 
stimulation, activity in weakly coupled cells showed little increase, or 
even decreased (Fig. 3g). Therefore, the diverse coupling of individual 
neurons to population rate reflects differences in the causal influence 
of the population on these cells. 

These observations suggest that more strongly coupled neurons might 
receive stronger synaptic input from neighbouring neurons. Consistent 
with this view, intracellular in vivo measurements indicated that popu- 
lation coupling measured from a neuron’s spike train resembles the 
coupling measured from the neuron’s subthreshold membrane poten- 
tial, which mostly reflects synaptic inputs (Extended Data Fig. 7; see 
also ref. 17). 

To investigate whether population coupling is related to synaptic 
connectivity, we studied cortical populations that were first imaged in 
vivo and then recorded in vitro**”’. Population activity in superficial 
cortical layers was measured during presentation of natural movies and 
images in lightly anaesthetized mice. As in deeper layers, these cells 
showed a wide heterogeneity in population coupling (Fig. 4a, b). Cou- 
pling was not related to a cell’s spatial location (Fig. 4b and Extended 
Data Fig. 8a), and we observed no correlation between in vivo coupling 
and intrinsic properties such as resting potential, input resistance and 
spike threshold (Extended Data Fig. 8b-d). Population coupling was 
again consistent between the halves of each recording, even when these 
halves contained different sets of stimuli (p = 0.85, P< 107°, n = 4,215 
neurons from 15 experiments). Next, we analysed synaptic connectivity 
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Figure 3 | Population coupling under natural and optogenetic stimulation 
conditions. a, Spike-triggered population rate (stPR) for the four example 
neurons in Fig. la-g during responses to a natural movie. The curves are similar 
to those measured during spontaneous activity (Fig. le). b, Comparison of 
population coupling during spontaneous and evoked activity, across cells and 
experiments (p = 0.88, P< 107! rank correlation). c, Spike rasters (one 
row per presentation of a natural movie) and corresponding firing rate for a 
strongly coupled neuron (red neuron in a). Dashed line indicates baseline firing 
rate. Shaded area shows the duration of the movie clip. d, Same as ¢ for a 
weakly coupled neuron (purple neuron in a). e, The increase in mean firing rate 
of a cell in response to natural movie presentations (relative to baseline) 
correlates with population coupling measured during spontaneous activity 

(p = 0.38, P< 10}, rank correlation, n = 431 neurons from 13 recordings in 
8 animals). Black diamonds, running median. Points outside the x-axis 

range appear at the border for display purposes. f, Population rasters showing 
activity of deep-layer V1 neurons during four example trials (out of 75 in total), 
where the network was optogenetically driven by blue light in a mouse 
expressing channelrhodopsin-2 sparsely in layer 5. Neurons are sorted by their 
population coupling during spontaneous activity. Shaded area shows the 
duration of optogenetic stimulation. g, Change in mean firing rate evoked by 
optogenetic stimulation correlates with population coupling measured during 
spontaneous activity (9 = 0.51, P< 10°! rank correlation, n = 237 
neurons). Points outside the x-axis range appear at the border for display 
purposes. Black diamonds, running median. 


using paired in vitro whole-cell recordings of the same neurons (Fig. 4c). 
We focused on pyramidal cells, identified by somatodendritic morpho- 
logy when filled with Alexa dye, and by a regular-spiking firing pattern. 

These paired recordings revealed a significant correlation between a 
neuron’s population coupling and its probability of receiving synaptic 
input from its neighbours (Fig. 4d). According to a logistic regression 
fit, a pyramidal cell whose population coupling was one standard devi- 
ation below the average had a probability of 0.14 to receive a connection 
from a neighbouring neuron, whereas for a pyramidal neuron whose 
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Figure 4 | Neurons with strong population coupling receive more synaptic 
inputs from their neighbours. a, Neurons in superficial layers of mouse 

V1 were bulk-loaded with Oregon Green BAPTA-1 dye and their activity 
recorded using two-photon imaging during presentation of natural movies 
and images. Population coupling was assessed as the correlation between each 
cell’s calcium signal with the summed signal of all other neurons. The two 
coloured traces show a segment of activity from a strongly and a weakly coupled 
neuron (orange and blue, respectively), each superimposed on the averaged 
population activity (grey). Scale bar, 20% AF/F for each single neuron, 5% for 
population average. b, Left, structural scan of an imaged volume measuring 
~260X260X56 jim. Right, pseudocolour representation of population 
coupling for each of 147 neurons in the volume. c, Synaptic connectivity of 


coupling was a standard deviation above the mean this probability 
increased to 0.23 (a 65% increase). In contrast, there was no correla- 
tion between a neuron’s population coupling and the probability that 
it provided synaptic outputs to neighbouring cells (Fig. 4e). These results 
are consistent with a correlation between population coupling and input 
connection probability as high as p = 0.65 (Extended Data Fig. 9). 
These data suggest that diversity in mean input connectivity may 
form the circuit mechanism for diversity in population coupling. The 
diversity in mean input connectivity that we have observed is separate 
from the tendency of cortical pyramidal cells to receive more synaptic 
connections from cells with similar sensory tuning”*’®. Thus, if the net- 
work is stimulated non-specifically (for example by optogenetic activa- 
tion ofa random neuronal subset, Fig. 3f, g), the effects of this nonspecific 
stimulation, amplified by recurrent cortical connections, will be lar- 
gest for neurons with strong mean input connectivity (Extended Data 
Fig. 10a, b). To make these intuitive ideas more precise, we extended 
an established modelling framework”, and mathematically analysed a 
model cortical circuit in which different excitatory neurons have dif- 
ferent mean input probability (see Supplementary Information). This 
model reproduced many of our experimental findings, including the 
relationship between population coupling and mean input connectivity 
(Extended Data Fig. 10c and Fig. 4d), the structure of spontaneous 
pairwise correlations (Extended Data Fig. 10d and Fig. 2b), and the addi- 
tivity of correlations predicted by sensory tuning similarity and by popu- 
lation coupling strength (Extended Data Fig. 10e-h and Fig. 2e, f). 
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a subset of the imaged neurons was later assessed using simultaneous in vitro 
whole-cell recordings. Top, four example pyramidal cells (solid white circles), 
recorded in vitro together with an additional fast-spiking interneuron 
(dashed circle) that was excluded from later analysis. Bottom, four synaptic 
connections were found between these four pyramidal neurons, shown here 
coloured by their in vivo population coupling. The two weakly coupled neurons 
(blue) received zero or one input, while the strongly coupled neuron (yellow) 
received two inputs. d, Logistic regression estimate of probability to receive 

a synaptic connection as a function of the population coupling of the target 
(dashed black lines, 95% confidence intervals; error bars, mean + standard 
error for binned data). e, As in d for outgoing connections. 


If neurons with stronger population coupling are more likely to be 
driven by nonspecific excitation, they may show greater modulation by 
non-sensory factors, which are believed to be conveyed to sensory cortex 
by diffuse, nonspecific inputs’*’’. To test this hypothesis, we analysed 
population recordings made from area V4 of monkeys performing a 
cued-saccade task’’. Consistent with the hypothesis, strongly coupled 
neurons increased their firing rate during saccade preparation into their 
receptive field (Fig. 5a). Weakly coupled neurons often showed the 
opposite effect (Fig. 5b). Overall, there was a marked correlation between 
a neuron’s population coupling and its modulation by saccade pre- 
paration (p = 0.37, P= 10”, Fig. 5c). 

In conclusion, population coupling constitutes a previously unap- 
preciated dimension characterizing the relationship of individual neu- 
rons to population activity. Strongly coupled neurons (choristers) are 
more strongly activated during multiple conditions that nonspecifically 
increase the activity of their local network: not only sensory stimuli, but 
also spontaneous fluctuations, polysynaptic optogenetic stimulation, 
and top-down modulation. Conversely, weakly coupled neurons (solo- 
ists) are more immune to these population-wide events. Population 
coupling differs both across and within cell classes; it remains to be 
determined whether this within-class diversity reflects further subdivi- 
sions of these classes (such as pyramidal cells with different long-range 
axonal targets**”*), or continuous within-class variation in cellular 
parameters such as input connection probability. Moreover, a neuron’s 
population coupling need not be fixed for life; neurons that rats learn to 
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Figure 5 | Population coupling under top-down stimulation conditions. 

a, Mean firing rate of an example neuron in primate V4 with strong population 
coupling (red neuron in Fig. 1h). This neuron showed higher firing rate 
(solid curve) while saccades were prepared into its receptive field (RF) than 
outside of it (dashed curve). b, An example neuron with weak population 
coupling (purple neuron in Fig. 1h) showed suppressed firing during saccade 
preparation into its receptive field. c, The change in firing rate of V4 neurons 
during saccade preparation into their receptive field (relative to saccade out 
of their receptive field) correlates with population coupling (p = 0.37, 

P=10 ”°, rank correlation, n = 262 neurons). Black diamonds, running 
median. 


use while controlling a brain—machine interface show increased cor- 
relation with LFP*, indicating increased population coupling, and 
thus suggesting an increase in mean synaptic input strength. This 
single, simple variable relating each neuron’s participation in the popu- 
lation code to underlying circuit connectivity may prove critical to 
understanding cortical computation and plasticity. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Electrophysiological recordings. Recordings in mouse primary visual cortex were 
performed under licenses from the UK Home Office in accordance with the Animal 
(Scientific Procedures) Act 1986. In the electrophysiological experiments, record- 
ings were made from male and female mice older than 6-7 weeks, of C57BL/6J or 
Thy1-ChR2 line 18 strains”. Randomization was not required as all animals were 
treated similarly. 

For recordings from awake mice, animals were first implanted with a custom- 
built head plate and recording chamber under isoflurane anaesthesia. After 3 days 
of recovery accompanied by Rimadyl treatment, the mice underwent two or three 
head-restraint acclimatization sessions. On the recording day, the animals were 
briefly anaesthetized with isoflurane and a craniectomy of ~1.5 X 1.5mm was 
made above the left primary visual cortex. The dura was resected with a 30G needle 
and the brain was covered with Ringer solution and Kwik-Cast (WPI Inc.). At least 
1.5h of recovery between this procedure and the beginning of the recording were 
allowed, a period that was more than sufficient for the animals to fully resume their 
normal behaviour in their home cage. In some animals, an additional recording was 
performed the following day; in this case the brain was again protected with Ringer 
solution and Kwik-Cast in between the two recording sessions. Thirteen recordings 
were made in eight animals, of which five recordings were in three Thyl-ChR2 
line 18 mice, and the rest in C57BL/6J mice. Recordings were made with Buzsaki32 
or A4x8 silicon probes (NeuroNexus technologies, Ann Arbor, Michigan), low- 
ered to a depth of 550-1050 1m (median 700m) by a PatchStar manipulator 
(Scientifica, Uckfield, UK). Signals were amplified and stored for offline analysis using 
the Cerebus data acquisition system (Blackrock Microsystems, Salt Lake City, Utah). 
For electrophysiological recordings in anaesthetized animals (Fig. 2 and Extended 
Data Fig. 7), anaesthesia was induced with a mixture of fentanyl citrate, fluanisone 
and midazolam (0.8 mgkg~', 25 mgkg' and 12.5 mgkg’ ', respectively). A total 
of nine recordings from anaesthetized animals were analysed. In a few of these 
experiments, whole-cell recordings were performed in parallel with the silicon 
probe population recording. The whole-cell intracellular recordings were per- 
formed following the standard techniques for blind patching’”*', with the patch 
pipette positioned ~200-400 jim away from the plane of the silicon probe shanks. 

For the electrophysiological data, the experimental setup was similar to the one 
described in ref. 33, except that mice were standing in a custom-built tube, instead 
of ona floating Styrofoam ball. In brief, visual stimuli were presented on two of the 
three available LCD monitors, positioned ~25 cm from the animal and covering a 
field of view of ~120° X 60°, extending in front and to the right of the animal. Visual 
stimuli analysed in the present study consist of multiple presentations of natural 
movie video clips (taken from The Life of Mammals, BBC, London, UK) and drift- 
ing gratings (100% and 25% contrast, 2 Hz, 12 directions, spatial frequencies of 
0.02, 0.04 0.08 and 0.25 cycles per degree). For recordings of spontaneous activity, 
the monitors showed a uniform grey background. 

Recordings in auditory cortex (Extended Data Fig. 3) were made in awake, head- 
fixed male rats, in accordance with protocols approved by the Rutgers University 
Animal Care and Use Committee. Experimental details were previously described 
in refs 3, 18, 34. In the recordings analysed here, silicon probes with four shanks, 
and four contacts in a tetrode configuration on each shank were used (NeuroNexus 
technologies, Ann Arbor, Michigan). 

Recordings in primate area V4 were made in two male monkeys (Macaca mulatta) 
using 16-channel U-Probes (Plexon Inc., Dallas, Texas), in accordance with NIH 
Guide for the Care and Use of Laboratory Animals, the Society for Neuroscience 
Guidelines and Policies, and Stanford University Animal Care and Use Committee; 
full experimental details were previously described in ref. 27. In brief, as part of the 
behavioural task, a cue presented near the central fixation point indicated the 
direction of a saccade that would be necessary at the end of the trial to obtain a 
reward in the event of a change in the orientation of a peripheral stimulus. 
Optogenetic stimulation. For in vivo electrophysiological recordings with opto- 
genetic stimulation, we used Thy1-ChR2 line 18 mice, a transgenic line in which 
ChR2 is expressed in a sparse subset of cortical L5 pyramidal cells”. In these mice, 
in addition to recording spontaneous and visually evoked activity, we also recorded 
activity in response to optogenetic stimulation consisting of 500 ms light pulses of 
~2 mW per mm intensity, delivered by a 470 nm (blue) LED (M470F1, Thor Labs, 
Newton, New Jersey), and focused on a circular area of ~1 mm/’ centred on the 
silicon probe insertion site. 

To estimate the proportion of recorded neurons directly activated by ChR2, we 
performed a control experiment, in which 20- and 40-Hz trains of five 2-ms pulses 
of ~100 mW per mm‘ intensity were delivered to the silicon probe insertion site 
using a MBL-III 473 nm 150 mW blue light laser (CNI, Chungchun, China). Reliable, 
short latency (<5 ms) and low jitter (<1 ms) responses were found in ~5% of the 
units, indicating that ~95% of cells were driven polysynaptically rather than directly. 
Data analysis (electrophysiology). Spikes were detected and visually verified using 
the programs NDmanager and Neuroscope”. Spike sorting involved an automated 


stage, performed using KlustaKwik’*, and a manual verification stage for which 
Klusters® or KlustaViewa” were used. Detailed analysis of coupling to population 
rate and LEP was performed only for units with isolation distance >20 (see refs 38, 39). 
Units were selected and sorted blind to measures of population coupling and all 
other cellular parameters. 

Population rate (for example, in Fig. 1c) was computed by accumulating all the 
detected spikes (both well-isolated units and multi-unit activity) with 1 ms resolu- 
tion, and smoothing the resulting vector with a Gaussian of half-width 12 ms. The 
population rate used to compute the stPR for any individual unit did not include 
the spikes of that unit. The baseline level of each stPR (which reflects the mean 
population rate) was subtracted. 

For stLFP computations, raw extracellular signals were first digitally band-pass 
filtered offline between 0.1 and 200 Hz to isolate the LFP. For units recorded on a 
particular shank, the LFP was taken from an adjacent shank (200 jim away), to avoid 
contaminating the stLFP by the spike waveform itself. The size of stLFP was taken 
as the ordinate value of the negative peak of the cross-correlation in a 1 s interval 
around 0 lag. stLFPs were normalized similarly to stPRs (see below). 

The size of the stPR was quantified as the value of the spike-triggered population 
rate at 0 time lag. Thus, the population coupling of unit i is given by: 


1 
a= Ty [AOD (fm) 
j#i 
Here, f represents the smoothed firing rate of a neuron (Gaussian kernel of half 
width 12/,/2 ms), jis its mean firing rate, and || f|| represents its norm (thatis, the 
number of spikes fired). To compare the sizes of stPRs across recordings, they were 
normalized by the median size of the stPR of the shuffled data in each recording 
(see next paragraph). 

Spike shuffling was performed according to the previously described raster 
marginals model". In more detail, the recording was first divided into non-over- 
lapping 1 ms bins. A binary matrix was then constructed with one column for each 
time bin and one row for each isolated unit as well as additional rows for the mul- 
tiunit spiking on each shank. Each matrix element contained a 1 if the correspond- 
ing unit spiked in the corresponding time bin. To shuffle, random 2-by-2 submatrices 
were repeatedly chosen with each row and column of the submatrix containing a 
0 and 1; the positions of 0s and 1s were then exchanged in the submatrix, which 
leaves the summed values of each row and column identical. As we have discussed 
previously", such a shuffling procedure produces in the limit a uniform sample 
from a distribution subject to the constraints on the mean firing rate and popula- 
tion rate distribution of the original data. 

To characterize the responses of individual cells to drifting grating stimuli, the 
response for each orientation was averaged across trials, contrasts and spatial fre- 
quencies, and the orientation with the highest value was taken as preferred. The 
spatial frequency that evoked the highest response along the preferred orientation 
was taken as the preferred spatial frequency. The orientation selectivity index (OSI) 
was computed as: (Rpret — Rortho) /' (Roret + Rortho)» Where Rpref and Rortho are the 
responses in the preferred and orthogonal orientations. f;/fy was taken as the ratio 
between the power of the average response around 2 Hz (which was the temporal 
frequency of the drifting grating stimuli) and the mean increase in the firing rate 
above the spontaneous level. 

In primate area V4, we measured the firing rate changes during saccade prepa- 
ration for isolated single neurons. Spikes were counted during the interval between 
0.5 s after cue onset and the end of the post-cue period (that is, the start of the blank 
period), and were converted to firing rates for each trial based on the duration of 
that period. We compared firing rates from trials for which the cue indicated a 
saccade into the receptive field would be required and trials for which the cue indi- 
cated a saccade to an orthogonal location outside the receptive field would be required. 
Only spikes from correctly performed trials were considered for this analysis. Peak 
stPR size was measured identically to rodent electrophysiological recordings, and 
was computed from the continuous recording of the entire experimental session. 
Two-photon imaging. All procedures were performed under licenses from the UK 
Home Office in accordance with the Animals (Scientific Procedures) Act 1986. In 
the imaging experiments C57BL/6J mice of P22-26 age and both sexes were used. 
Anaesthesia was induced with fentanyl, midazolam and medetomidine (0.05 mg kg ts 
5.0mgkg 'and0.5 mgkg ', correspondingly) and later maintained by isoflurane 
(0.3-0.5%) in a 60:40 mixture of oxygen and nitrous oxide. The experimental setup 
for two-photon imaging was described in refs 24, 25. In brief, visual stimuli were 
presented on an LCD monitor ~20 cm from the animal, covering a field of view of 
~105° X 85°. The calcium-sensitive dye Oregon Green BAPTA-1 (OGB-1) was 
bulk loaded into the superficial layers of the cortex together with sulforhodamine 
101 to distinguish glia from neurons. 

We used two data sets in the analysis, recorded from 18 and 15 animals, respec- 
tively. In the first data set (previously described in ref. 24), visual stimuli included 
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full-field and localized drifting gratings and natural movies. The gratings were 
shown at 100% contrast, 2 Hz temporal frequency, spatial frequency of 0.035 cycles 
per degree, and 8 directions. Cortical areas of ~285 X 285 jum were imaged at 7.6 Hz 
and 256 X 256 pixel resolution, and the imaging was performed for ~8-16 differ- 
ent depths, 7 um apart. 

In the second data set", static natural images and natural movies (either video 
clips from The Life of Mammals or cage scenes from a head-mounted mouse camera) 
were presented. A volume of ~200 X 200 X 50 1m was imaged simultaneously at 
512 X 512 X 4 pixel resolution using a piezoelectric objective mover. In each exper- 
iment 2-3 such volumes were imaged. 

After in vivo imaging, whole-cell in vitro recordings of the imaged neurons were 

performed as previously described™. In brief, at the end of the in vivo imaging 
session the brain was removed, 300-|1m thick coronal slices were cut and the slice 
containing the in vivo imaged region was located by the presence of OGB-1. A simul- 
taneous recording from several nearby neurons was performed using a standard 
whole-cell in vitro protocol. Synaptic connections between neurons were tested 
using 30-Hz trains of five suprathreshold 5-ms current pulses, and checking for 
postsynaptic responses in the other cells. The current injection train was repeated 
at least 30 times with 15-s intervals. Pyramidal cells and interneurons were dis- 
tinguished by spike width, resting membrane potential, responses to 1-s depolar- 
izing currents, and somatodendritic morphology when filled with Alexa dye. For 
the analysis of the relationship between coupling strength and probability to receive 
(provide) a synaptic connection (Fig. 4), we excluded pairs where the potential 
target (source) cell was an interneuron or a neuron for which no in vivo calcium 
trace was available. Simultaneously patched neurons could have been imaged non- 
simultaneously, which happened when the cells resided in different imaging planes 
or volumes. In total, 379 neurons were recorded; of 854 potential input connec- 
tions tested, 146 synaptic connections were found. Of 854 potential output connec- 
tions tested, 155 synaptic connections were found. The number of potential input 
and output connections were equal by coincidence; some of the input connections 
were not analysed as output connections because one of the cells was either an 
interneuron or had no in vivo data, and vice versa. 
Data analysis (imaging). Outlines of recorded neurons were detected semi- 
automatically using custom software written in MATLAB (MathWorks, Natick, 
Massachusetts). All pixels within individual outline were averaged to give for each 
neuron a single AF/F signal, which was additionally high-pass filtered above 0.02 Hz 
to correct for slow artefacts such as photobleaching. 

The population coupling of each cell i was estimated by a similar formula to that 
used in spike train data: 


c= Taq [HC D(H) a 
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where f now represents the continuous fluorescence trace of a cell (AF/F), and || f || 
its standard deviation. Note that c; is proportional to the Pearson correlation of 
cell i with the summed activity of all other neurons. To pool data across multiple 
experiments, the normalization factor was computed not by spike shuffling (which 
is not possible for calcium traces) but by z-scoring the population coupling values 
within each recording. 

For grating responses, the orientation selectivity index (OSI; Extended Data 
Fig. 6g) was estimated as (Rpest — Rortho)/(Rbest + Rortho)s Where Rest and Rortho are 
the interpolated responses to drifting gratings in the best direction and the direc- 
tions orthogonal to it. 

Statistics. Unless explicitly stated otherwise, comparisons between pairs of sets of 
values were made using Wilcoxon rank sum test and correlations were measured 
using Spearman’s rank correlation coefficients (an important exception being the 
pairwise spike train correlations, for which, as commonly accepted, we used Pearson 
correlation). No statistical methods were used to predetermine sample size. 
Pairwise correlation prediction. To predict pairwise correlations from coupling 
values, we used an extension of the raster marginals model" that now also included 
a population coupling parameter for each neuron. The spike trains of all neurons 
are first converted into a binary 0-1 matrix with 20-ms bins. The parameters extracted 
correspond to: (i), the total number of 1 s in row i, for all i between 1 and the number 
of rows n; (ii), the total number of columns whose sum is i, for all i between 0 and n; 
and (iii) the inner product of each row with the sum of all rows (representing the 
stPR for the unit corresponding to the row). 

Pairwise correlations were predicted by constructing a random matrix that respects 
the constraints represented by the above parameters, and computing the correlations 
between all pairs of rows in this random matrix. To construct such a random 
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matrix we first construct a random matrix that respects constraints (i) and (ii), 
as described in ref. 14, and then perform a sequential operation to impose satis- 
faction of constraint (iii). This sequential operation consists of repeatedly exchan- 
ging 2 X 2 sub-matrices between rows whose population coupling is too high and 


too low. Specifically, observe that if there exists a row n; for which ( nj, >> 1; 
j 


(the inner product with the sum of all rows) is too high, then there must also 
exist another row for which this inner product is too low; this is because 


SO(n, 35 nj ) =( 5 ni, S50; ), which depends only on the population rate dis- 
i j i ij 
tribution and is thus invariant to sub-matrix exchange. 

The satisfaction of constraint (iii) occurs by repeatedly finding a pair of rows 
that violate the constraint in opposite directions, while such pairs exist. For each 
such pair of rows, we find a pair of columns that forms a 2-by-2 sub-matrix such 
that each row and column of it contains a 0 and 1, and switch the 0s and 1s ina way 
that preserves constraints (i) and (ii) and reduces the violation of constraint (iii) in 
this pair of rows. This procedure is repeated until (iii) is satisfied (up to a small 
error of at most n). 

To measure how well this method could predict pairwise correlations, we used 
cross-validation. Time bins were divided at random into an equally-sized training 
and test set. Model parameters were fit from the training set, and the prediction 
error of the model was quantified as SSmodeter = >> (¢ij — eis where cj is the cor- 


i<j i" 
relation between neurons i and j measured in the test set, and ¢j the correlation 
predicted by the model with parameters estimated on the training set. This error 


was compared to a ‘ground truth’ prediction, SSdatazr = >> (cj — ers where Gi is 


the correlation between neurons i and j as directly measured in the training set. 
The quality of the model fit was defined to be the fraction of explainable variance: 
[SStot — SSmodetérr] 4. /[SStot — SSdatakrr], where SSiot = > (ci — zy’ and @ is the mean 
pairwise correlation. iJ 

Latent variable analysis of the population recordings was performed using 
the publically available toolbox of L. Buesing and J. Macke (http://bitbucket.org/ 
mackelab/pop_spike_dyn; see also ref. 21). The toolbox uses a nuclear norm mini- 
mization method‘ to fit a low-dimensional model of the form s; ~ Poisson(e”’), where 
yt = Cx; +d. The latent variables x, were modelled using a multivariate Gaussian 
distribution. The performance of this model was assessed by its ability to predict 
pairwise correlations, quantified with cross-validation measures described above. 
To generate correlations we fitted the parameters of the model and then used these 
parameters to sample spike count vectors. 
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Extended Data Figure 1 | Pearson correlation between spike trains of 
individual units and the population rate. To estimate the relation ofa neuron 
to the population, an alternative to spike-triggered population rate (stPR) 
would have been to compute the Pearson correlation coefficient of the neuron’s 
spike train with the summed population rate of all other recorded cells 

(a measure we term ‘Pearson coupling’). This measure, however, is biased by 
firing rate. a, Pearson coupling and stPR were computed for a set of individual 
units in an example experiment. Pearson coupling is related to the stPR, but 
not identical to it. b, The numerical value of the Pearson coupling depends 
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strongly on the bin size used, but the correlations measured with different bin 
sizes are tightly related. c, Pearson correlation is biased by firing rate*!”. 

The spike train of a single cell was ‘thinned’ to different firing rates by keeping 
only a random subset of its spikes; Pearson correlation with the population was 
recalculated for different values of firing rate. A strong effect of firing rate is 
seen. d, Performing the same analysis for population coupling (measured by 
stPR) demonstrates that this measure does not suffer from rate bias. For this 
reason, we chose to quantify population coupling with stPR in this work. 
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Extended Data Figure 2 | Neighbouring neurons differ markedly in 
population coupling during spontaneous activity. a, Dividing the data 
into two halves shows that population coupling, measured as the height of 
stPR at 0 time lag, is highly consistent over time (n = 431 neurons from 

13 experiments; p = 0.76, P< 1072 rank correlation). Coloured dots 
represent the four example cells. b, As in a for peak spike-triggered local field 
potential (stLFP) (p = 0.58, P< 10 1°). ¢, Differences in stLEP disappear after 
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shuffling spikes in a manner that preserves each neuron’s mean firing rate 
and the population rate (compare with Figure 1g). Inset, stLFPs in the actual 
spike trains (red) and after shuffling (grey), for neurons from all experiments 
(compare with Figure 1g). d, stPR size of V4 neurons is consistent over 

time (n = 262 neurons from 43 experiments; p = 0.95, P< 1071, rank 
correlation). 
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Extended Data Figure 3 | Neighbouring neurons in auditory cortex differ 
markedly in population coupling. a, Spike-triggered population rate (stPR) 
for four example neurons recorded on the same electrode shank, during 
spontaneous activity in rat primary auditory cortex. b, Differences in 
population coupling disappear after shuffling spikes in a manner that preserves 
each neuron’s mean firing rate and the population rate distribution. c, d, As in 
a, b for the spike-triggered local field potential (stLFP). e, Dividing the data 
into two halves shows that population coupling, measured as the height of 
stPR at 0 time lag, is highly consistent over time (n = 76 neurons from 

3 experiments; p = 0.92, P< 10°!) rank correlation). Coloured dots 
represent the four example cells. f, As in e for stLFP (p = 0.81, P<10 1, 
rank correlation). 
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Extended Data Figure 4 | Firing rate, burstiness and population coupling. 
a, Similarly to other studies”**’ our recordings allow separation of narrow 
spiking (putative Pvalb" inhibitory) and wide spiking (primarily excitatory 
pyramidal) neurons. Here, we used a trough-to-peak time of 0.66 ms as the 
separation criterion. b, There is a negative correlation between burstiness 
(the ratio between the peak and baseline of a neuron’s autocorrelogram) and 
mean firing rate, which is also the case individually for wide spiking (n = 384, 
p = —0.60, P< 107°, rank correlation) and narrow spiking (n = 47, 

p= —0.82,P< 10° °, rank correlation) neurons. a.u., arbitrary units. c, There is 
a positive correlation between burstiness and population coupling, which is also 
the case individually for wide spiking (p = 0.46, P< 10°, rank correlation) 


and narrow spiking (p = 0.50, P= 4 X 10-4, rank correlation) neurons. 

d, There is a negative correlation between firing rate and population coupling, 
which is also the case individually for wide spiking (p = —0.27, P= 10 ’, rank 
correlation) and narrow spiking (p = —0.37, P = 0.01, rank correlation) 
neurons. The correlation between population coupling and firing rate can be 
predicted from the correlations between burstiness and firing rate and between 
population coupling and burstiness; the partial rank sum correlation 

between population coupling and firing rate, once burstiness is taken into 
account, is insignificant (9 = 0.06, P = 0.25). This is also the case for wide 
spiking (p = 0.01, P = 0.78) and narrow spiking (p = 0.07, P = 0.65) 

neurons individually. 
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Extended Data Figure 5 | Latent variable analysis. a, Population coupling of 
each neuron is highly correlated with its loading in a single-factor latent 
variable model (see Methods). The similarity of each cell’s population coupling 
and loading indicates that the low-dimensional structure found by the latent 
variable model is homologous to that found by the coupling model. b, Percent 
of pairwise correlation structure explained by a latent variable model with 
1-5 factors (black), and by the coupling model introduced in the present study 
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(dashed purple line). Error bars show standard error. While the coupling model 
outperforms latent variable models with less than four degrees of freedom, 
this difference may arise primarily from the assumption of a Gaussian 
distribution for the latent variables. Indeed, if the population rate distribution 
generated by the latent variable model is substituted into the coupling 

model instead of the (correct) populate rate distribution, extremely poor 
performance results (dashed grey line). 
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Extended Data Figure 6 | Population coupling and visual stimulation in 
mouse V1. a, stLFPs computed for the four example neurons of Fig. 1a—f, from 
intervals of natural movie presentation (inverted for ease of comparison, see 
Figure 1f). b, Comparison of stLFP size during spontaneous and evoked activity 
across all experiments (p = 0.72, P< 107! rank correlation). c-e, Population 
coupling is plotted versus the f;/fo ratio, preferred spatial frequency and 
orientation selectivity index (OSI) for neurons recorded in the infragranular 
layers of V1. All correlations are statistically insignificant. f, Similar to movie 


presentations (Figure 3e), the mean change in the activity of a cell in response 
to grating presentations (relative to baseline, averaged across contrasts and 
orientations) correlates with population coupling measured during 
spontaneous activity (p = 0.32,P =2 X 10 °,n = 217, rank correlation). Black 
diamonds, running median. g, In the two-photon imaging data (of ~10,000 
cells) only a very weak correlation between OSI and population coupling was 
found (p = 0.066, P< 10 °, rank correlation). 
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Extended Data Figure 7 | stLFP reflects the correlation between membrane __ silicon probe. b, Comparison of stLFP and V,,-LFP cross-correlation 
potential (V,,,) and LFP. a, Example of a silicon probe population recording © (VmLFPcc, appropriately scaled along the ordinate axis) for the intracellularly 
performed simultaneously with a whole-cell recording (in an anaesthetized recorded cell. c, stLFP for the four neurons from a and the intracellularly 
animal). Four neurons shown in colour were recorded on the same shank ofthe _ recorded neuron, exhibiting diversity in the strength of coupling to LFP. 
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Extended Data Figure 8 | Population coupling in two-photon data is not 
correlated with location and intrinsic properties of the neurons. a, For each 
neuron in the central region of the imaging field (defined as a square quarter 
of the total imaging area), we compared its coupling to the population of all 
other neurons in the central region, with its coupling to population of all 
neurons outside of the central region. The two were highly similar; this was the 
case because the two population rate signals were themselves highly correlated 
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(on average across experiments the Pearson correlation was 0.77). Thus, 
differences in population coupling measured between cells do not reflect 
differences in the fraction of nearby neurons imaged. b-d, No significant 
correlation was observed between population coupling (measured in vivo) and 
resting potential, input resistance and spike threshold (subsequently measured 
in vitro). 
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Extended Data Figure 9 | Correlation between input connectivity and 
population coupling. a, Cumulative distribution of population coupling of a 
target pyramidal neuron when an input connection was present (red) and 
when it was absent (blue). The medians (arrows) are significantly different 

(P = 0.008, rank sum test, n = 854 pairs). b, As in a for population coupling of 
the source pyramidal cells. The distributions shown were used for the logistic 
regression analysis in Fig. 4. c-f, To estimate what strength of correlation 
between input connectivity and population coupling would give rise to these 
observations, we constructed random directed graphs of 1,000 nodes (each 
node representing a L2/3 pyramidal cell) with the probability of connection 


from node j to node i given byy/p', Pout. where the propensities to receive and 
provide connections (pin and Pou, correspondingly) were randomly and 
independently chosen for each node from a Gaussian distribution. The 
resulting distribution of the number of input connections in a typical network is 
shown in ¢; the number of output connections was (by construction) similarly 
distributed. In addition, each node was assigned a population coupling 
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value, highly correlated to the number of its input connections (on average 

p = 0.65); this correlation in a typical network is shown in d. e, f, We next asked 
how the relationship between measured connectivity and population 
coupling would look if we sample from 33 such randomly generated networks 
(equal to the number of animals used in our experimental data), the same 
amount of data empirically available in our in vitro recordings (that is, the 
connections between 2-3 randomly selected groups of 2-6 nodes). Results very 
similar to those of Fig. 4 were typically obtained (e, f; compare with a and 
Fig. 4d; error bars in f indicate standard error for binned data). In particular, 
when the entire procedure was repeated 1,000 times, in over 30% of the 

cases the P value of the difference between the medians (presented in a, e) 
was higher (that is, less statistically significant) than the value of 0.008 
obtained in the actual data. Thus, the results shown in Fig. 4 and in a are 
consistent with a strong correlation between connection probability and 
population coupling. 
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Extended Data Figure 10 | Mathematical model for the relationship 
between nonspecific connectivity, specific connectivity, and correlations. 
a, A recurrent network where excitatory cells (triangles) send synaptic 
connections (arrows) to each other and to inhibitory cells (circles). Weakly 
coupled neurons (bottom) receive only connections from neurons with similar 
sensory preference (for example, for stimulus orientation, indicated in blue 
versus red). Strongly coupled neurons (top) also receive nonspecific 
connections from neurons of different sensory preference. b, The effect of 
nonspecific drive, such as caused by non-sensory top-down inputs, or 
occurring due to artificial optogenetic stimulation, is amplified through 
recurrent connections, leading to stronger activation of neurons with greater 
mean local input (darker shading). c, d, Correlations predicted by the model 
(analytically derived in Supplementary Information). c, Population coupling 
versus nonspecific connectivity c;, for all simulated excitatory neurons. 
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d, Pseudocolour plot of predicted pairwise correlations for a random subset of 
excitatory neurons, ordered by population coupling. e-h, Dependence of 
correlations on specific and nonspecific connectivity. e, Predicted correlations 
based on nonspecific connections versus total observed correlations. 

f, Predicted correlation based on nonspecific connectivity versus difference in 
preferred orientation. As in the experimental data (Fig. 2e), no relation is 
observed. g, Observed correlation versus difference in preferred orientation. 
As has been widely reported, observed correlations are largest for neurons of 
similar orientation preference. h, Residual correlation (after removing 
prediction from nonspecific connectivity) versus difference in preferred 
orientation. Again as in our experimental data (Fig. 2f), the residual correlation 
is largest for neurons of similar orientation preference, indicating an additive 
relationship between correlations generated by specific connections and 
correlations generated by nonspecific connections. 
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Counteraction of antibiotic production and 
degradation stabilizes microbial communities 


Eric D. Kelsic', Jeffrey Zhao', Kalin Vetsigian? & Roy Kishony’? 


A major challenge in theoretical ecology is understanding how 
natural microbial communities support species diversity’*, and 
in particular how antibiotic-producing, -sensitive and -resistant 
species coexist””*. While cyclic ‘rock-paper-scissors’ interactions 
can stabilize communities in spatial environments” '’, coexistence 
in unstructured environments remains unexplained’”’’. Here, 
using simulations and analytical models, we show that the oppos- 
ing actions of antibiotic production and degradation enable coex- 
istence even in well-mixed environments. Coexistence depends on 
three-way interactions in which an antibiotic-degrading species 
attenuates the inhibitory interactions between two other species. 
These interactions enable coexistence that is robust to substantial 
differences in inherent species growth rates and to invasion by 
‘cheating’ species that cease to produce or degrade antibiotics. At 
least two antibiotics are required for stability, with greater num- 
bers of antibiotics enabling more complex communities and 
diverse dynamic behaviours ranging from stable fixed points to 
limit cycles and chaos. Together, these results show how multi- 
species antibiotic interactions can generate ecological stability in 
both spatially structured and mixed microbial communities, sug- 
gesting strategies for engineering synthetic ecosystems and high- 
lighting the importance of toxin production and degradation for 
microbial biodiversity. 

Antibiotic-producing species are common in natural microbial 
communities'*””"*, Ecological models of antibiotics typically assume 
pairwise species relationships, where antibiotic producers inhibit sens- 
itive species more than resistant species. These pairwise inhibitory 
interactions, combined with costs for production and resistance, can 
lead to relationships of cyclic dominance among species (for example, 
rock-paper-scissors games), which can support coexistence in spatial 
environments”'' beyond the limit set by competitive exclusion*”. 
However, such pairwise interaction models lead to coexistence 
through the separation of species into distinct spatial domains, 
whereas in nature antibiotic-producing, -resistant and -sensitive spe- 
cies appear to intermix even at very small length scales’””’. 
Furthermore, species communities stabilized through pairwise anti- 
biotic interactions are not resilient to the high level of species dispersal 
expected in nature’*’*. Understanding how multiple antibiotic-pro- 
ducing species coexist despite dispersal remains an open question. 

The inhibitory interaction between an antibiotic-producing species 
and an antibiotic-sensitive species can be attenuated by the presence of 
a third ‘modulator’ species (Fig. 1a). One established mechanism for 
antibiotic attenuation is enzymatic degradation”, a common mech- 
anism for antibiotic resistance’’. In principle, a modulator species 
could also intensify inhibitory interactions between two species”, 
for example by inducing antibiotic production. However, when testing 
for such interactions among a collection of soil isolates using a three- 
species interaction assay (Fig. 1b), we observed that intensification was 
rare, while attenuation was common (Extended Data Figs 1, 2 and 
Extended Data Table 1). Realizing that such three-way attenuating 
interactions commonly occur among natural species motivated us to 


explore their impact on ecological dynamics. We focus on antibiotic 
attenuation caused by degradation, which we observed experimentally 
(Extended Data Figs 1d and 2c), but our analysis can be generalized to 
other attenuation mechanisms such as antibiotic suppression™. It is 
known that antibiotic-degrading species can coexist together with 
sensitive species when an antibiotic is provided externally’. 
However, when antibiotics are produced by the species themselves, 
these two-species communities are no longer stabilized by degrada- 
tion”’”*. The impact of antibiotic-degrading species on the stability of 
larger ecosystems has not been explored. 

We investigated how antibiotic-degrading species affect the 
dynamics of microbial communities containing antibiotic producers 
by modifying a classical spatial model of antibiotic-mediated interac- 
tions. As in previous spatial models of antibiotic inhibition””’, we 
consider antibiotic-producing, -sensitive or -resistant species pheno- 
types. However, unlike in previous models, in addition to intrinsic 
resistance we also consider resistance through antibiotic degradation 
(Fig. la, b). Antibiotic-degrading species remove antibiotics from 
nearby locations, thereby protecting not only themselves but also 
neighbouring species. The simulations are performed on a grid 
(Fig. 1c), with each simulation step consisting of: production of anti- 
biotics around antibiotic-producing species (within area of size Kp), 
removal of antibiotics near antibiotic-degrading species (within area of 
size Kp), killing of antibiotic-sensitive species within the antibiotic 
zones, and finally colonization of empty regions on a new grid by 
randomly choosing surviving species within a given dispersal radius, 
dispersal (Extended Data Fig. 3; Methods). 

In these spatial inhibition-zone models, communities with intrinsic 
resistance and with resistance through antibiotic degradation result in 
dramatically different patterning and robustness to dispersal. We 
simulated a simple three-species and three-antibiotic network that 
exhibits cyclic dominance. Consistent with previous studies”””’, pair- 
wise interactions among intrinsically resistant species result in coex- 
istence of all three species through single-species domains that 
continually chase each other around the grid (Fig. 1d, left). 
However, we found that three-way interactions created by antibiotic 
degradation lead to tight intermixing of species (Fig. 1d, right). This 
fine-scale intermixing allowed us to investigate whether a spatial envir- 
onment was necessary for the coexistence of these antibiotic-degrading 
communities. Spatial structure is needed for coexistence in pairwise 
antibiotic-interaction models”, and diversity of the community with 
intrinsic resistance collapses when the dispersal radius increases 
(Fig. le, left; Supplementary Videos 1-5). In contrast, the antibiotic- 
degrading community maintains diversity across any level of dispersal, 
even with complete mixing between time steps (Fig. le, right; 
Supplementary Videos 6-10). 

We characterized the stability of these communities with respect 
to model parameters by considering the analytical limit of an infini- 
tely large environment with complete mixing between time steps 
(Tdispersal—> %3 a ‘mixed inhibition-zone model’; Methods). Consi- 
dering different initial species abundances, the community with 
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intrinsically resistant species coexists only at a single unstable fixed 
point; starting in any other initial condition leads to extinction of all 
but one species (Fig. 2a, left). In contrast, the community with anti- 
biotic degradation coexists with a stable fixed point and a large basin of 
attraction (Fig. 2a, right). Moreover, antibiotic degradation enables 
stable communities to form despite large differences in inherent spe- 
cies growth rates (g}, g>, g3; Fig. 2b, right). Thus, even in environments 
with complete mixing, the interplay of antibiotic production and 
degradation leads to coexistence that is robust to large perturba- 
tions of species abundances and substantial differences in inherent 
growth rates. 
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Community stability is maximized at intermediate levels of anti- 
biotic degradation (Fig. 2c and Extended Data Fig. 4a). Levels of degra- 
dation that are too high mostly eliminate the effects of the antibiotics, 
after which inherent growth rate differences lead to one species taking 
over. Levels of degradation that are too low do not allow coexistence, as 
expected, since Kp = 0 is equivalent to intrinsic antibiotic resistance. 
Stability is therefore maximal at intermediate levels of degradation 
where a negative feedback can operate, such that increased abundance 
of a given species results in its increased inhibition (Extended Data 
Fig. 4b-d). In contrast, stability is maximal at high levels of anti- 
biotic production (Kp). Thus, strong antibiotic production can stabil- 
ize diversity when combined with intermediate levels of antibiotic 
degradation. 

Production and degradation of multiple antibiotics also leads to 
robust coexistence in a well-mixed chemostat setting. We modelled 
the cyclic three-species, three-antibiotic interaction network using a 
single resource chemostat model. While these models are inherently 
unstable with intrinsic resistance, implementing resistance through 
enzymatic degradation of antibiotics allows coexistence of all species 
through stable fixed points or limit cycles (Methods). As in the inhibi- 
tion-zone model, stability occurs despite inherent growth rate differ- 
ences among species and was strengthened at high levels of production 
and intermediate levels of degradation (Extended Data Fig. 5a). We 
also observed robust coexistence for different forms of antibiotic 
inhibition functions and when inhibitory interactions are diffuse”, 
that is, when all species have finite sensitivity to each antibiotic 
(Extended Data Fig. 5b). This generality with respect to different mod- 
els, assumptions, and parameter values suggests that the counteraction 
of antibiotic production and degradation can support coexistence in a 
wide variety of settings. 

These three-species ecosystems are also stable with respect to chea- 
ters; that is, they resist invasion by species that gain a small growth 
advantage by ceasing production or degradation of antibiotics. We 
consider two types of cheating species: those that stop producing anti- 
biotics and revert to intrinsic resistance while enjoying the production 
of antibiotics by their parental species, and those that stop degrading 
antibiotics to become antibiotic sensitive, yet still enjoy protection by 
their parental species (Fig. 3a). Starting with a stable three-species 
community, we simulated potential cheater invasion by adding a small 
amount of a cheater species derived from one of the existing species. 
We calculated the final abundance of a cheater once the community 
reached steady state while varying the growth advantage of the cheater 
compared to its parent: g,?*"*¥ = (1 + «)g; We assume that each 
species has a different inherent growth rate, g, < g. < g3. For small 


Figure 1 | Replacing intrinsic antibiotic resistance with degradation-based 
resistance generates community robustness to species dispersal. a, Pairwise 
interactions among antibiotic-producing (P), -sensitive (S) and intrinsically 
resistant (Ry) species are compared to three-way interactions where the sensitive 
species can be protected by an antibiotic degrading species (Rp). b, Images of a 
YFP-labelled probe Escherichia coli strain (S, yellow-green) growing in the 
presence of two Streptomyces colonies. The inhibition of the probe strain by a 
producer (dark area around P) is unaffected by an intrinsically resistant 

species (R,, left), but is strongly attenuated around an antibiotic-degrading species 
(right, yellow-green halo around Rp colony). Images are representative of cases 
without and with three-way interactions among 54 Streptomyces pairs tested 

(1 replica, Extended Data Figs 1 and 2). ¢, Spatial inhibition-zone model. A 
producer (P) kills nearby species that are sensitive to its antibiotic (crossed out S), 
but does not affect resistant species (Ry, Rp). Sensitive species are protected by 
degrader species (right, S within dashed circle around Rp). Surviving species then 
replicate and disperse over distance rgispersal- d, Snapshots of spatial simulations 
for cyclical three-species, three-antibiotic interaction networks (labels indicate 
species phenotypes for each antibiotic). e, Intrinsically resistant communities 
collapse with increased dispersal (left), while communities with antibiotic 
degradation are robust to any level of dispersal (right). Insets show typical sub- 
region snapshots. Number of species is based on average Shannon diversity of 
sub-regions (Methods). raispersat is measured in grid units (where individual grid 
cells have side length 1). 
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Figure 2 | Strong antibiotic production with intermediate levels of 
degradation leads to stable communities robust to initial conditions and 
substantial differences in species growth rates. a, Communities with intrinsic 
resistance diverge from an unstable fixed point (green dot, left), while 
communities with antibiotic degradation converge towards a stable fixed point 
within a large basin of attraction (green area, right). Shown in trilinear 
coordinates are abundance changes per generation (white arrows) and example 
trajectories (black) for species with inherent growth rates: g, < g. < g3. 

b, Communities with degradation remain stable even for large differences in 


é, the only cheater that can invade the community is the production 
cheater derived from the fastest growing strain (Fig. 3b). This cheater 
replaces its parent, forming a new community of three species inter- 
acting through only two antibiotics. Coexistence in this simpler com- 
munity is robust to parameter variation (Extended Data Fig. 6) and is 
also evolutionarily stable: it does not allow invasion by cheaters arising 
from any of its species (Extended Data Fig. 7). All other cheaters in the 
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Figure 3 | Communities are robust to invasion by cheaters that cease 
antibiotic production or degradation. a, Production or degradation cheaters 
(open circles with solid or dashed lines, respectively) are introduced at low 
abundance into a stable three-species community (filled coloured circles). 

b, When cheater growth advantage ¢ is small, only the production cheater of the 
fastest growing species can invade (species 3, green). This cheater replaces its 
parent, forming a new stable community, which is in turn robust to further 
invasions. The other two production cheaters can invade only when ¢ is large 
enough, taking over the community (yellow) or replacing their parent in a 
stable community (red). c, Degradation cheaters cannot invade the community 
at low ¢, whereas at high enough ¢ they can invade without replacing their 
parents, rather generating a stable four-species community. A four-species 
interaction network is shown for the species 2 degradation cheater: grey 
interactions are inherited from the parent species; yellow interactions are 
specific to the cheater. Shaded areas span lower/upper bounds when the 
community exhibits sustained oscillations. 
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inherent species growth rates (right), while communities with intrinsic 
resistance are always unstable (left). c, Maximal community stability occurs at 
high levels of antibiotic production (Kp) and intermediate levels of degradation 
(Kp). Shading shows rate of exponential convergence (green) or escape (grey) 
from the fixed point. Black lines in panels b and c show the stability region 
boundary based on linear stability analysis (Methods). Units for production 
and degradation strength are species — 1 Tgeneration is the number of generations 
taken to get closer/further to the fixed point by a factor of 10. 


three-species, three-antibiotic community cannot invade, even with a 
small growth advantage (that is, for small positive ¢ their final abund- 
ance is zero; Fig. 3c). With higher growth advantages, these cheaters 
can ultimately invade by dominating the community, by replacing the 
parent in a newly formed three-species community, or by generating a 
new four-species community (Fig. 3c). The emergence of these four- 
species communities motivated us to search for more complex ecosys- 
tems with greater numbers of species and to explore their dynamic 
behaviours. 

Analysis of more complex networks identified many ways for anti- 
biotic production and degradation to generate stability. Randomly 
sampling larger networks with up to six species and five antibiotics, 
we found many stable community topologies, all of which included 
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Figure 4 | Complex interaction networks support coexistence through 
diverse dynamic behaviours. An example of a stable network with four species 
interacting through production and degradation of three antibiotics. 
Depending on the strengths of antibiotic production and degradation, this 
community can coexist through stable equilibrium, limit cycles or chaos 
(Methods). Line colours indicate species identity. 
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antibiotic-degrading species (Extended Data Fig. 8). Notably, greater 
numbers of antibiotics generally increase the number of species that 
can coexist. While the fraction of network topologies that support stable 
communities can be small, the combinatorial increase in the number of 
possible networks provides multitudes of ways for antibiotic production 
and degradation to generate stable communities with large numbers of 
species. Coexistence among species in these various network topologies 
can occur through different dynamic behaviours, including stable fixed 
points, limit cycles, or chaos, depending on the strengths of antibiotic 
production Kp and degradation Kp (Fig. 4 and Extended Data Fig. 9). 
Thus, in addition to enabling the long-term coexistence of complex 
communities, the combined effect of antibiotic production and degra- 
dation can dramatically impact community dynamics. 

Our findings suggest new possibilities for engineering multi-species 
microbial consortia* and shed light on the role of antibiotic production 
and degradation in maintaining biodiversity within natural microbial 
communities. Of course, natural microbial ecosystems are extremely 
complicated"’; they contain orders of magnitude more species than we 
modelled and include additional interactions at the level of resource 
competition, metabolic cross-feeding, phage invasion, and predator- 
prey relationships. We expect that further insights into ecosystem 
stability and assembly will emerge by understanding how these mechan- 
isms generate both pairwise and higher-order multi-species interactions. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Mixed inhibition-zone model of cyclic three-species communities. Model 
description. We derived a formula for the changes of species abundances in the 
inhibition-zone model for the limit of complete mixing after each time step 
(raispersar >) and infinite population size. Let X; be the abundance of species i 
per unit area, Kp the killing area around each producer and Kp the degradation 
area around each degrader. Since species disperse randomly, the fitness f; of 
species i depends on its inherent growth rate g; and the fraction of area 1 — Pyin 
in which it is not killed by antibiotics: 


fi=8i(1— Prin). 

For each antibiotic, a sensitive individual S will only be killed if it is located where 
there is at least one producer P within a Kp neighbourhood and there is no 
protecting antibiotic-degrading (Rp) species within a Kp neighbourhood. Since 
individuals are randomly placed, the number of cells within such neighbourhoods 
follows a Poisson distribution, so that if A is the expected number of cells in a 
region then the probability of having zero cells in the region is e~. Thus the 
probability of being in the killing zone of the producer is 1—e~ ***", and the 
probability of not being in the degradation zone of Rp is e~ X°*?, where Xp and 
Xp are respectively the abundances of species producing and degrading the anti- 
biotic. Combining these we obtain 


Pr =e K0XD (1 —e KrXr) 


Thus for the cyclic three-species network the fitness of each species is: 


fi=gi(1—e¥% (1-2) 
fi=gi(1— 2% (12) 


f= gs(1—e- #8 (1 eM) 


We update species abundance for each time step using the discrete time formula: 


X;i(t)fi(t) 
EX (OHO 
where the denominator ensures that total species abundance remains constant. 
Linear stability analysis. We found the fixed point numerically for each set of 
parameters and then found the eigenvalues of the Jacobian computed at the fixed 
point. The logarithm of the maximum absolute value of the eigenvalues was used 
as a stability measure in Fig. 2b, c to determine the region of stability. The fixed 
point is stable if this value is less than zero. 
Robustness to mutation and invasion by cheating species. Each cheating species 
was cloned from a parent by copying all antibiotic phenotypes except for a muta- 
tion in the phenotype with respect to a single antibiotic: PR, for production 
cheaters, and Rp—S for degradation cheaters. Starting with a three-species com- 
munity at equilibrium, we added a small amount of the cheating species (Xcheater = 
0.001, with abundances of the other species renormalized so that }> X; = 1). We set 


the growth rate of the cheater to the parental growth rate with a growth advantage 
é, 80 that gcheater = (1+é)gparent- We ran each simulation until species abundances 
reached steady state or stable oscillations, and then we calculated the minimum 
and maximum values for each cheater during the last third of the simulation time. 
Coexistence of communities with three species and two antibiotics. Cyclic 
three-species communities maintain coexistence even when the fastest growing 
species ceases antibiotic production, resulting in a simpler three-species, two- 
antibiotic community. This network exhibits robust coexistence similar to the 
three-antibiotic network, so long as the species that does not produce antibiotics 
is the fastest growing species (Extended Data Fig. 6a). This simpler community 
also resists invasion by any remaining production or degradation cheaters 
(Extended Data Fig. 7b). 

We investigated whether further reductions of this network support coexist- 
ence. While coexistence is possible when only one of the two antibiotics is 
degraded, this coexistence occurs only for a small range of parameter values 
(Extended Data Fig. 6b). Based on a comprehensive sampling of parameter space, 
we found that further reductions to communities that produce only a single 
antibiotic do not support coexistence when inherent growth rates differ. This is 
consistent with the intuition that when only one antibiotic is being produced there 
will always be two species that are equivalent with respect to the antibiotic, which 
will lead to extinction of the slower-growing species. Thus, in contrast to spatial 
models where coexistence is possible on a single antibiotic'', coexistence in well- 
mixed models requires three species and at least two antibiotics. 

Generalizing the mixed inhibition-zone model for arbitrary interaction net- 
works. Model description. We calculated the probability of each species being 
inhibited for a general ecosystem with n, species and n, antibiotics, where species 


Xi(t+1)= 


can be any of the four antibiotic phenotypes (P, S, Ry or Rp) for each antibiotic. The 
assumptions are the same as for the mixed inhibition-zone model of cyclic three- 
species communities. Consider the situation in which antibiotic production is 
stronger than degradation (Kp > Kp, the other scenario follows similarly). For 
an individual of a target sensitive species, each of the other n, — 1 species can be 
either near enough to affect the target through production and degradation of 
some of the antibiotics (with probability P,,.,), can be at intermediate distances in 
which they affect the target through production but not degradation (Pintermediate)» 
or can be far away so that they do not affect the target in any way (Py). These three 
distance ranges make for 3""~! possible combinations for the presence or absence 
of the other species within any neighbourhood of the target species. Based on the 
phenotypes of the sensitive target species and the other species, we calculated all 
combinations that result in antibiotic inhibition. For each species i, we calculated 
the probabilities Pyears Pintermediate ANd Pfr from the levels of production and 
degradation of the antibiotics, in this case: 


Prear = 1 —e7 Ko% 


Prar = e KeX 


Pintermediate = Pfar — Phear 


Combining these, we calculated the probability P. of each inhibitory combination 
occurring, multiplying the probabilities of each species being either near, inter- 
mediate, or far according to the requirements of each combination. Finally, we 
obtained the total probability of the target species being inhibited by summing 
across all possible inhibitory combinations with Pj, = > P.. As in the three- 


species model we let the fitness of each species be: 


fi=8i(1— Paint) 


and 


Testing for coexistence in more complex networks. We randomly sampled up to 
10° networks for each combination of 4-6 initial species and 1-5 initial antibiotics 
by randomly choosing one of the four phenotypes (S, P, Ry or Rp) for each species/ 
antibiotic combination. When the total number of possible networks was less than 
10° we tested all networks. We excluded networks that contained antibiotics or 
species with identical properties from subsequent simulations, since these are 
equivalent to instances of networks with a smaller number of antibiotics or species. 
With n, initial antibiotics and n, initial species at initial abundances 1/n,, we 
calculated the number of final species surviving at abundance X; > 0.01/n, after 
10,000 time steps. Simulations were binned by the maximum number of surviving 
species across all possible combinations of the parameters: ginax = (1.2 or 2.5), 
Kp = (10 or 30) and Kp = (3 or 10). For a given giax, Species growth rates 
were evenly spaced between 1.0 and gyax. With total number of network 
topologies nota = 4" /na!, even this sparse sampling of parameters shows that 
the total number of topologies that support stability is quite large (Extended 
Data Fig. 8). 

Calculating effective number of species using Shannon diversity. We calculated 
effective species numbers by the Shannon entropy H of the species distribution: 


H=-— >? Xjlog, X; 


with species frequencies X; normalized such that }> X;=1. Effective number of 


species is then given by Qf. ‘ 


Chemostat model of cyclic three-species communities. Model description. Our 
chemostat model is adapted from classical models of antibiotic interactions with a 
constant inflow of a single resource and constant dilution’’. Ordinary differential 
equations track the concentration of the resource (Z), the abundances of three 
species (X;), and the concentrations of three antibiotics (C;), implementing the 
cyclical interaction network as described previously. Each species consumes 
resources and produce antibiotics. The species degrade antibiotics to which they 
are resistant, and their growth is inhibited by antibiotics to which they are sens- 
itive. The resource dynamic is: 


dZ (Zp Z)D yo 2G 


dt m ? 


where Z, is the concentration of the added resource, D is the dilution rate, G; a 
function that describes the growth rate of species i considering antibiotic and 
resource levels, and m is a conversion factor between resources and species, 
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so that G;/m is the resource consumption rate of each species. The species 
abundances change according to: 


at = X;(G;—D) with Gi=gigeze ®S. 


Growth rate increases with the concentration of resources following Monod kin- 
etics, where k, determines the concentration of half-maximal resource absorption. 
The growth rate is also affected by antibiotics, decreasing exponentially with the 
level of antibiotics to which a species is sensitive (here an antibiotic with concen- 
tration C; inhibits species i). The parameter Kp characterizes the strength of 
antibiotic inhibition, and g; is a species-specific maximal growth rate. 

The antibiotic concentration dynamics are given by: 


ag, 
dt 


PX)Gj—KpXiG)— DG 


The parameter p determines the amount of antibiotics produced per cell division, 
the second term assumes that species k degrades the antibiotic produced by species 
j through mass action kinetics, and the parameter Kp characterizes the strength of 
antibiotic degradation. 

Rescaled chemostat model. To analyze the stability of the three-species three-anti- 
biotic interaction network, we first note that 


& (Li — 20+ 2) = DIL 


im 


Z.+Z), 
and therefore after a transient 
Z=Z,—)>° Xi /m 


which allows us to eliminate the resource concentration degree of freedom. 

We rescale variables to reduce the number of free parameters: time so that D = 
1, Zand k, so that Z, = 1, and we eliminate m by substituting X;’ = X;/m, C;’ = C/ 
m, Kp' = mKp and Kp’ = mKp. Dropping the primes we obtain: 


a — X;(G;—1) 
-X ec, 
Gi=8iz eS Bea lies 
dC; 
7" PX{G; KpX;C; CG 


Linear stability analysis. The problem of finding the fixed point in the rescaled 
model was reduced to a single nonlinear algebraic equation, which was numer- 
ically solved for different parameters to determine the corresponding equilibriums 
for X; and C;, At the fixed point mm =0, and therefore G;=1. This allows us to 
express the C; variables as functions of 


and the g; constants. Substituting the expressions for C; into 


results in a system of linear equations for X; terms, which can be solved as a 
function of ©. This allows us to express ) > X; as a function of © and thus obtain 
i 


a nonlinear equation for ©. For Extended Data Fig. 5a, the stability of the fixed 
point was determined by finding the eigenvalues of the Jacobian at the fixed point. 
A negative largest real component of the eigenvalues indicates stability. 
Parameter values for main text figures:. Fig. 1: grids were 200 X 200 with 
wrapping boundaries. Species were initially randomly seeded on the grid at low 
total abundance (1%), or in an initial configuration with three large domains for 
dispersal > 20 to reduce the initial chance of extinction of all species at high levels of 
mixing (with intrinsic resistance). Fig. le, lines show average of ten simulations, 
calculating the effective species numbers based on Shannon diversity of species 
within 40 X 40 sub-regions (shown in insets) at 100 evenly spaced locations on the 
grid at the final time; other parameters: rproduction = 35 Tdegradation = 35 dispersal = 
(3, 4, 6, 10, 20, 80, 200); runtimes for each rgispersal are respectively: (150, 150, 75, 
75, 75, 25, 25), after which the overall spatial patterns were relatively unchanging. 
For intrinsic resistance with rgispersai = (80, 200) it was possible for all species to be 
inhibited at the same time step, in which case we chose one cell at random to 
proceed to the next generation. 

Fig. 2: we estimated the size of the basin of attraction by running simulations 
starting from 210 different initial species abundances that were equally spaced ona 
triangle lattice inside the x; + x. + x3 = 1 two-dimensional simplex. The basin of 
attraction area is the fraction of simulations that move to the fixed point with all 
three species coexisting. Fig. 2a, g, = 1.0, g = 1.1, g3 = 1.2; left: Kp = 4, Kp = 0; 
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right: Kp = 12, Kp = 3. Fig. 2b, left: Kp = 4, Kp = 0; right: Kp = 40, Kp = 4. Fig. 2c, 
2 = 1, = 3, g3 = 2. 

Fig. 3: g; = 1.0, go = 1.1, g3 = 1.2, Kp = 16, Kp = 4. 

Fig. 4: g; = 1.0, g. = 1 + 1/30, g3 = 1 + 2/30, gq = 1.1; stable equilibrium: Kp = 
10, Kp = 10; limit cycles: Kp = 12, Kp = 20; chaos: Kp = 20, Kp = 55. 
Code availability. MATLAB code for the spatial inhibition-zone model, mixed 
inhibition-zone model and chemostat model is available from the authors upon 
request. 
Note on the mechanism of three-way attenuating interactions. Antibiotic 
degradation is a common mechanism of antibiotic resistance”' and is therefore 
a probable mechanism for the attenuation of inhibition that we observed in the 
three-species interaction assays and the antibiotic modulation assays (Extended 
Data Figs 1 and 2). There is strong evidence in support of this assumption based on 
many reports of antibiotic degrading species protecting sensitive species from 
antibiotics°*>”**!**, Antibiotic degradation is also a known mechanism of resist- 
ance for the antibiotics in which we observed significant attenuation (Extended 
Data Fig. 1c): tobramycin”, ciprofloxacin®’, B-lactam antibiotics*® (cefoxitin, 
pipericillin, penicillin), nitrofurantoin’’, rifampin** and trimethoprim”. 
Furthermore, we experimentally confirmed that degradation through B-lacta- 
mases contributes to the attenuation of B-lactam antibiotics through a modified 
version of the antibiotic modulation assays that uses B-lactamase inhibitors 
(Extended Data Figs 1d and 2c). However, we note that the models we use can 
be easily generalized to additional mechanisms that generate antibiotic attenu- 
ation, for example through antibiotic suppression”. 
Interaction experiments. No statistical methods were used to predetermine sam- 
ple size. Three-species interaction assays. All possible pairings of 16 Streptomyces 
species from a spore collection (~100 spores per 1l) were pinned 1 cm apart on 
9-cm Petri dishes with 1X oatmeal agar, and plates were grown in a darkened 
plastic bin for 13 days at room temperature. Overlay media without thiamine was 
prepared in test tubes in 5-ml aliquots, melted in an autoclave (to melt, not 
sterilize), and cooled to 42 °C in a water bath. An overnight liquid-broth culture 
of E. coli expressing YFP was mixed 9:1 with 1,000 thiamine stock, 50 ull of the E. 
coli mixture was added to overlay agar, quickly vortexed to mix and then poured 
uniformly onto the plate. Plates were incubated at 37 °C for 18 h and photo- 
graphed in bright field and YFP channels to record the antibiotic inhibition zone. 
Combinations in which one species created a large inhibition zone were scored 
using the modulation index M = (rg, — T9)/(Tm + To), where rm is the radius of the 
inhibition zone in the direction of the modulator and ry is the radius of the 
inhibition zone in the opposite direction. Extended Data Fig. 1b shows data from 
54 combinations in which one species made a large inhibition zone, with five 
antibiotic producers and 11 modulator species (one plate removed due to con- 
tamination). Extended Data Fig. 2a shows images and scoring. We observed 
similar attenuation in follow-up technical replicas of individual strain combina- 
tions and when testing on other media. 
Antibiotic modulation assays. Antibiotic modulation assays were similar to the 
three-species assays except that the antibiotic producing species was replaced with 
pure antibiotic and the assay geometry was modified to enable more high- 
throughput measurements. Modulator inoculation plates were prepared by 
spreading 50 ul of 20% glycerol spore stocks onto 1/2X oatmeal agar 9-cm Petri 
dishes with glass beads, incubated for 1 week at 30 °C in plastic bins, then stored at 
room temperature. 25 ml of 1/2X oatmeal agar was spread evenly over a 14-cm 
Petri dish. On day 1, a small amount of antibiotic stock was pipetted onto the 
centre of the agar, absorbed and stored at 30 °C (cefoxitin 10 il, chloramphenicol 
10 yl, ciprofloxacin 10 ul, doxycyclin 10 ul, nitrofurantoin 20 ul, penicillin 20 ul, 
piperacillin 15 pl, rifampicin 20 ul, tobramycin 40 ul, trimethoprim 20 pl). On day 
2, a thin radial line for each of three modulator species was inoculated using the 
edges of sterilized 60-mm cover glass (VWR 48393-070) onto the 14-cm Petri dish. 
Plates were incubated in plastic bins at 30 °C for 3 days and small areas of 
contamination were cut out of the agar. Overlay media was prepared in 25-ml 
aliquots, as in the three-species assays. Overnight cultures of yellow fluorescent 
protein (YFP) and cyan fluorescent protein (CFP) producing E. coli were com- 
bined at equal ratios (by OD) and mixed 9:1 with 1,000 thiamine stock. 1 ml of 
the E. coli mixture was added to the overlay media, vortexed, and poured evenly 
over the surface of the oatmeal agar. All plates were incubated at 37 °C between 16- 
21 h and photographed in bright field, YFP and CFP channels to record the 
antibiotic inhibition zone. Test strains included the two strongest attenuators from 
the three-species assay, Streptomyces coelicolor and a collection of soil isolates from 
Massachusetts and Colorado soils. Combinations were marked with a dot for 
significance in Extended Data Fig. 1c where the average modulation index M 
deviated from 0 by at least twice the standard error in the mean (P < 0.05, two- 
sided t-test, n = 3 technical replicas). These images were also visually inspected to 
confirm warping of the inhibition-zone boundary. Extended Data Fig. 2b shows 
examples and scoring for different antibiotics. 
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f-Lactamase inhibitor assays. 1 ml of 1 mM 6-lactamase inhibitor was spread 
evenly over 14-cm 1/2X oatmeal plates with glass beads and allowed to dry in a 
ventilation hood (to minimize contamination). Overlay with YFP and CFP E. coli 
was the same as the antibiotic modulation assay except that 1 ml of 1 mM 
B-lactamase inhibitor stock was added to the overlay media and vortexed before 
adding E. coli. Extended Data Fig. 1d shows data for a collection of 14 species, four 
antibiotics and three B-lactamase inhibitors. Extended Data Fig. 2c shows example 
assay images. 

Isolation of soil bacteria. Isolation and identification of five Streptomyces from 
soil A (Massachusetts) as described in ref. 17. We chose five strains from a col- 
lection of 47 isolates that did not produce strong antibiotic inhibition zones on 
oatmeal agar. Soil B (Colorado) was collected from White River National Forest in 
unincorporated Pitkin County. We chose ten strains from a collection of 55 iso- 
lates that did not produce strong inhibition zones, including two Streptomyces 
species, five Actinobacteria species and three non-Actinobacteria species. Species 
identification was through Sanger sequencing of 16S ribosomal DNA. 

Imaging and data analysis. The three-species assay images were photographed 
using a custom-built automated fluorescent imaging system as described in ref. 40. 
Other assays were using a similar but newer system. Levels of antibiotic modu- 
lation were scored by manually recording the edges of the inhibition zones of the 
fluorescent E. coli using MATLAB. Example images were normalized to fill the 
dynamic range and gamma adjusted to improve contrast. 

Growth media. 1X oatmeal agar: 72.5 g 1~' Difco oatmeal agar (BD). 1/2X 
oatmeal agar: 31.25 g 1”' Difco oatmeal agar (BD) and 6.25 g 1 ‘Bacto agar 
(BD), with ~100 pl” ' 10 M NaOH to adjust to pH 7. Media for overlays consisted 
of M63 salts (2 g1~' (NH4),SO,, 13.6 g 1”! KH3PO,, 0.5 mg I~! FeSO,7H,0) 
supplemented with 0.4% glucose, 0.02% casamino acids, 1 mM MgSO,, 1.5 mM 
thiamine, 7.5 gl" * bacto-agar (BD), adjusting pH to 7.0 with 1 M NaOH. 1,000x 
thiamine was prepared at 1.5 M in H,O and filter sterilized. 

Chemical stocks. Cefoxitin (Sigma C-4786) 50 mg ml 'inH,0, chloramphenicol 
(Sigma C0378) 30 mg ml * in EtOH, ciprofloxacin (Sigma 17850) 10 mg ml * in 


H,O + 3 pl ml”? 10 M HCI, doxycyclin (Sigma D-9891) 50 mg ml’ in H,0, 
nitrofurantoin (Sigma N7878) 10 mg ml | in DME, penicillin (Sigma 13750) 50 
mg ml‘ in H,0, piperacillin (Sigma P8396) 50 mg ml! in H,0, rifampicin 
(Sigma R3501) 16 mg ml‘ in DMSO, tobramycin (Sigma T4014) 50 mg ml * 
in H,O, and trimethoprim (Sigma T7883) 5 mg ml’ in H,O. B-Lactamase 
inhibitors were prepared at 1 mM in H,O: clavulanic acid (as potassium clavula- 
nate, Sigma 33454), tazobactam sodium salt (Sigma T2820), sulbactam (Sigma 
59701). 
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Extended Data Figure 1 | Antibiotic attenuation is widespread among 
natural antibiotic producing species isolated from soil. a, Diagram of three- 
species assays: measuring the antibiotic inhibition zone of E. coli around a 
producer strain enables quantification of three-species interactions caused by a 
modulator strain. b, Attenuating interactions dominate among a set of 54 
Streptomyces producer-modulator combinations. Triangle and square markers 
show scoring from example images in Fig. 1b, left and right respectively. 

c, Antibiotic modulation assays: attenuating interactions dominate among 
combinations of soil species with a panel of ten pure antibiotics. Combinations 
are coloured by modulation index as in b and marked with a dot where the 
modulation index is significantly different from zero (n = 3, Methods). Strains 1 
and 2 are the strongest modulators from the three-species assays, strain three is 
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Streptomyces coelicolor, strains 4—18 are additional soil isolates (Extended Data 
Table 1). DOX, doxycycline; CMP, chloramphenicol; TOB, tobramycin; CPR, 
ciprofloxacin; FOX, cefoxitin; PIP, piperacillin; PEN, penicillin; NIT, 
nitrofurantoin; RIF, rifampicin; TMP, trimethoprim. d, Scatter plots of average 
modulation index M for 12 species from panel c, with and without B-lactamase 
inhibitors. Points occur off the diagonal for the B-lactams piperacillin and 
penicillin but near the diagonal for the structurally unrelated antibiotics 
rifampin and nitrofurantoin, consistent with attenuation of the B-lactams 
through a mechanism of antibiotic degradation. Error bars are standard error of 
the mean for technical replicas with inhibitor (n = 3) or without inhibitor 
(control, n = 6). Control is addition of HO instead of inhibitor. Clav, clavulanic 
acid; tazB, tazobactum; sulB, sulbactam. 
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pi, m1 p1, m2 p1, m3 pi, m4 pi, m5 pi, m6 pi, m7 pi, m8 p1,m9g pi, m10 pi, m11 


p2, m1 p2, m2 p2, m3 p2, m4 p2, m5 p2, m6 p2, m7 p2, m8 p2, mg p2, m10 p2, m11 


p3, m1 p3, m2 p3, m3 p3, m4 p3, m5 p3, m6 p3, m7 p3, m8 p3, m9 p3, m10 ps3, m11 


p4, m1 p4, m2 p4, m3 p4, m4 p4, m5 p4, m6 p4, m7 p4, m8 p4, m9 p4, m10 p4, m11 


nitrofurantoin 


rifampicin 


14cm 
ciprofloxacin doxycycline trimethoprim tobramycin penicillin 
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Extended Data Figure 2 | Example images from three-species interaction 
assays and antibiotic assays. a, Images and scoring for three-species 
interaction assays from Extended Data Fig. 1b. p indicates producer species, m 
indicates potential modulator species. b, Example images and scoring from the 
antibiotic modulation assays from Extended Data Fig. 1c. Each plate shows tests 
for modulation of antibiotic inhibition for three different species against a 
different antibiotic. Colour lines show the size of the inhibition zone at the 
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location of the radially positioned modulator species; the white circles show the 
radius of the zone of inhibition as inferred from the left side of the plate that 
contains no modulators. c, Example images from an antibiotic modulation 
assay with B-lactamase inhibitors and the B-lactam antibiotic cefoxitin. Left and 
right side of each plate is inoculated with a line of species four from Extended 
Data Fig. 1c. Attenuation is significantly reduced by the B-lactamase inhibitors 
(especially clavulanic acid and tazobactam) when compared to controls (HO). 
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Extended Data Figure 3 | Illustration of the spatial inhibition-zone model. _ individuals (type S) are killed at any locations that still contain the 


The simulation is performed on a grid of size L X L. A single individual corresponding antibiotics (crossed out S; antibiotic values at each location are 
occupies each grid location. During each generation: (1) Individuals from calculated at centre positions); (4) empty locations of a new grid are filled by 
species of type P produce antibiotics within a circle of radius rproductions (2) randomly choosing from any surviving individuals within a radius raispersa- If 
individuals of type Rp remove antibiotics within a circle of radius rgegradation- there are no surviving individuals within raispersai then an empty location 
This process is repeated for each of the different antibiotics; (3) all sensitive remains empty. 
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Extended Data Figure 4 | Dependence of coexistence on degradation in the 
mixed inhibition-zone model. a, Stability analysis of the full parameter space 
in the mixed inhibition-zone model. Using simulations, we tested the stability 
of cyclic three-species, three-antibiotic communities with dense sampling of all 
possible parameters for the inhibition-zone model, varying strengths of Kp and 
Kp, initial abundances of species 1-3 and growth ratios g)/g, and g3/g). As in 
Fig. 2b, each grid shows a 100-fold range of growth rate ratios from 0.1 to 10. 
Large basins of attraction exist across a wide range of parameter values, with 
maximal stability at high levels of antibiotic production and intermediate levels 
of degradation. b-d, Intuition for why coexistence depends on degradation. 
b, The inhibition-zone model calculates the probability of a given sensitive 
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species being inhibited by an antibiotic producer (blue), or being protected by a 
degrading species (red), given the relative strengths of production (Kp) and 
degradation (Kp). The expected area covered by the overlapping circles (left) is 
used to calculate the corresponding inhibition and attenuation probabilities 
(percentage area of filled boxes, right; Methods). ¢, Focusing on one antibiotic 
in a stable three-species community, increasing the abundance of the yellow 
species creates negative feedback by decreasing the abundance of blue and red, 
which results in more inhibition of yellow. d, Communities are not stable at low 
levels of degradation due to positive feedback, whereby increasing the 
abundance of the yellow species results in a decrease of inhibition. 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


Limit cycles 


= 
ro) 
oO 


200 


—_— 
o 
o 


oO 
(YORN|IP) 1) 
Ayigeys yulod-paxi4 


Degradation strength (Kp) 
=) 
o 


50 
0 —0.05 
0 10 20 30 40 50 
Inhibition strength (Kp) 
b Exponential inhibition Monod inhibition Linear Inhibition 
Gy = R(ax)gie *P° G, => R(x)gqi/(1 + KpC3) Gy => R(x) 91 = KpC3 


Degradation strength (Kp) 


Inhibition strength (Kp) Inhibition strength (Kp) Inhibition strength (Kp) 
Partial inhibition Diffuse interactions 
on R(x)gi(e*P 4 His Gi=- Rage SOO) 
200 3 
Ss 
n 
% 150; os 
D o 
- a, 
2 g 
2 100} 2a 
2 = 
: : 
50+ z 
Si < 
[y) 
(a) 
0 
0 1 


Inhibition strength (Kp) Inhibition strength (Kp) 


©2015 Macmillan Publishers Limited. All rights reserved 


Extended Data Figure 5 | Coexistence of antibiotic degrading communities 
in a well-mixed chemostat model with three antibiotics. a, Stability of the 
three-species, three-antibiotic community in a single resource chemostat model 
in which species and antibiotics are completely homogeneous. Communities in 
the chemostat model may coexist through fixed points or limit cycles. For 
parameter sets in which the fixed point of the chemostat model was unstable, 
we started simulations close to the fixed point to determine if the community 
coexists through a limit cycle. Limit cycles occur in the areas between the 
dashed and solid black lines. Chemostat parameters: g, = 3, g5 = 6,93 = 9,k, 
0.5, P = 1. b, Communities coexist in the chemostat for a wide range of 
assumptions regarding antibiotic mechanisms. We simulated the chemostat 
model while changing how the action of the antibiotics is modelled and 
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observed robust coexistence across all models. For each simulation we started 
all species at equal concentration (X; = 0.2), ran the simulation until t = 100 
generations and calculated the Shannon diversity of the final species levels. The 
default chemostat model assumes exponential inhibition of species by 
antibiotics, but similar coexistence is observed for Monod-like inhibition, for 
linear inhibition (for G; < 0 we set G; = 0), when species are only partially 
inhibited or when each species is sensitive at some level to all antibiotics. G, is 
the growth rate of species 1 under inhibition, while g; is its maximal rate of 
growth; R(x) captures resource dependence on current species levels; C), C, and 
C; are the concentrations of antibiotics produced by species 1-3 respectively 
(Methods). Equations for the growth of species 2 and 3 have the same form as 
G,. All other parameters are the same as for panel a. 
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Extended Data Figure 6 | Coexistence of communities with three species require two antibiotics for stability. When only one antibiotic is degraded the 


and two antibiotics. a, Comparing community diversity in the mixed community either lacks stability (first panel), or is stable only for a small 
inhibition-zone model and the chemostat model. For each simulation we number of growth rates and initial conditions (second panel). When two 
started all species at equal concentration (X; = 1/3 for the inhibition-zone antibiotics are degraded the community is robustly stable to differences in 
model, X; = 0.2 for the chemostat model), ran the simulation for untilt = 100 species growth rates and initial conditions (third panel), provided that the 
generations and calculated the Shannon diversity of the final species levels. antibiotics inhibit the faster-growing species (species 2 and 3). Basin colours 


Other parameters are the same as in Fig. 2 for the inhibition-zone model or as in Fig. 2b; grey shows parameters for which no initial conditions were stable; 
Extended Data Fig. 5 for the chemostat model. b, Three species communities Kp = 40, Kp = 4. 
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Extended Data Figure 7 | Robustness of three species communities to 
invasion by cheaters in the mixed inhibition-zone model. Analysis of 
production cheaters (P—>R,, left) and degradation cheaters (Rp—S, right). As 
in Fig. 3, we plot the final abundance of each cheater as a function of its 
growth advantage ¢ over its parent species. a, Cheaters cannot invade the three- 
species, three-antibiotic network when their growth advantage is small, except 
for the production cheater of the species with the fastest inherent growth 
rate (species 3, green line), which replaces its parent generating a new stable 
community of three species interacting through two antibiotics. b, This 
resulting three-species, two-antibiotic community is resilient to invasion by 
all cheaters; cheaters must have a substantial growth advantage to invade 
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and take over. Parameters for both networks are the same as Fig. 3. Shaded 
areas indicate the maximum and minimum abundance when the 

community reaches stable oscillations. Note, the analysis above is for networks 
with g; < g < g;. The alternative network of g, > g> > g3, is less robust to 
invasion by production cheaters. Two cheaters can invade this community even 
with small : the production cheater for species 2 invades the network in a and 
gives rise to a stable three-species two-antibiotic community, while the 
production cheater for species 1 can take over the communities shown in a and 
b. For degradation cheaters this alternative network is similarly robust to 
cheaters as shown. 
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Extended Data Figure 8 | Complex network topologies support coexistence _use all of the initial antibiotics, Methods). Colours show the frequency of each 
of larger numbers of species in the mixed inhibition-zone model. For given _ outcome within all simulated networks, with grey where no stable networks 
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Extended Data Figure 9 | A community with chaotic dynamics. Plotting the steps of a simulation with 40,000 total generations, coloured with a slowly 
abundance of species 1-3 for the network from Fig. 4b. We show the last 30,000 _ changing gradient. The trajectories form a strange attractor. 
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Extended Data Table 1 


Strain information for experimental assays 


ED Fig. 1b | EDFig.1c | Species Notes Assays 
ED Fig. 2 
pi Kutzneria sp. 744 producer 1 
p2 Streptomyces clavuligerus (ATCC 27064) producer 1 
p3 Streptomyces sviceus (ATCC 29083) producer 1 
p4 Streptomyces sp. Mg1 producer 1 
p5 Streptomyces albus (J1074) producer 1 
m1 Streptomyces SPB74 al 
m2 Streptomyces griseoflavus Tu4000 a 
m3 Streptomyces sp. AA#4 1 
m4 1 Streptomyces pristinispiralis (ATCC 25486) 1 2):3 
m5 Streptomyces hygroscopicus (ATCC 53653) 1 
m6 Streptomyces SPB78 1 
m7 Streptomyces ghanaensis (ATCC 14672) al 
ms 2 Streptomyces roseosporus (NRRL 11379) 123 
m9 Streptomyces viridochromogenes (DSM 1 
40736) 
m10 Streptomyces lividans (TK24) 1 
mii Streptomyces sp. E14 i 
3 Streptomyces coelicolor 2,3 
4 Streptomyces sp. G1-4, JNO20492 soilA 23 
5 Kitasatospora sp. G4-12, JNO20549 soilA 2.3 
6 Streptomyces sp. G2-6, JNO20526 soilA 2,3 
z Unclassified Streptomycetaceae soil B 2:3 
8 Streptomyces sp. G4-8, JNO20545 soilA 2,3 
9 Streptomyces sp. G1-14 soilA 2 
10 Streptomyces sp. RI18-DHHV3 soil B 2 
11 Amycolatopsis sp. soil B, Actinobacteria 2:3 
12 Kribbella sp. NBRC 104208 soil B, Actinobacteria 2.3 
13 Rhodococcus sp. soil B, Actinobacteria 2.3 
14 Microbacterium sp. soil B, Actinobacteria 2 
15 Rhodococcus sp. 602 soil B, Actinobacteria 2 
16 Agromonas sp. soil B, Alphaproteobacteria 23 
17 Lysobacter sp. soil B, Gammaproteobacteria 2 
18 Bradyrhizobiaceae Bosea soil B, Alphaproteobacteria 2 
S E. coli MC4100-YFP/pCS-| assay strain w/ chromosomal YFP | 1 
S E. coli MC4100, pZS2R-YFP assay strain w/ YFP plasmid 2,3 
Ss E. coli MC4100, pZS2R-CFP assay strain w/ CFP plasmid 23 


Assay numbers: 1, three-species interaction assay; 2, antibiotic modulation assay; 3, B-lactamase inhibitor assay. 
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Global genetic analysis in mice unveils central role 
for cilia in congenital heart disease 


You Li‘, Nikolai T. Klena’, George C. Gabriel'*, Xiaoqin Liu’, Andrew J. Kim!, Kristi Lemke!, Yu Chen!, Bishwanath Chatterjee’, 
William Devine*, Rama Rao Damerla!, Chienfu Chang’, Hisato Yagi, Jovenal T. San Agustin’, Mohamed Thahir*?, 

Shane Anderton!, Caroline Lawhead!, Anita Vescovi', Herbert Pratt®, Judy Morgan®, Leslie Haynes®, Cynthia L. Smith®, 

Janan T. Eppig®, Laura Reinholdt®, Richard Francis', Linda Leatherbury’, Madhavi K. Ganapathiraju*”, Kimimasa Tobita’, 


Gregory J. Pazour? & Cecilia W. Lo! 


Congenital heart disease (CHD) is the most prevalent birth defect, 
affecting nearly 1% of live births’; the incidence of CHD is up to ten- 
fold higher in human fetuses”*. A genetic contribution is strongly sug- 
gested by the association of CHD with chromosome abnormalities 
and high recurrence risk*. Here we report findings from a recessive 
forward genetic screen in fetal mice, showing that cilia and cilia- 
transduced cell signalling have important roles in the pathogenesis 
of CHD. The cilium is an evolutionarily conserved organelle project- 
ing from the cell surface with essential roles in diverse cellular processes. 
Using echocardiography, we ultrasound scanned 87,355 chemically 
mutagenized C57BL/6J fetal mice and recovered 218 CHD mouse 
models. Whole-exome sequencing identified 91 recessive CHD mu- 
tations in 61 genes. This included 34 cilia-related genes, 16 genes 
involved in cilia-transduced cell signalling, and 10 genes regulating 
vesicular trafficking, a pathway important for ciliogenesis and cell 
signalling. Surprisingly, many CHD genes encoded interacting pro- 
teins, suggesting that an interactome protein network may provide 
a larger genomic context for CHD pathogenesis. These findings pro- 
vide novel insights into the potential Mendelian genetic contribution 
to CHD in the fetal population, a segment of the human population 
not well studied. We note that the pathways identified show overlap 
with CHD candidate genes recovered in CHD patients’, suggesting 
that they may have relevance to the more complex genetics of CHD 
overall. These CHD mouse models and >8,000 incidental mutations 
have been sperm archived, creating a rich public resource for human 
disease modelling. 

Inbred C57BL/6] mice mutagenized with ethylnitrosourea (ENU) were 
bred to recover recessive coding mutations (Extended Data Fig. 1). 
Phenotyping was conducted using non-invasive fetal echocardiography, 
an ultrasound imaging modality also employed clinically for CHD diag- 
nosis. This unbiased study design allows detection of even those CHD 
mutants inviable to term®, a segment of CHD cases that would be largely 
clinically inaccessible for genetic analysis. Using a combination of col- 
our flow, two-dimensional, spectral Doppler and M-mode imaging, a 
wide spectrum of CHD phenotypes were recovered, including cardiac 
looping defects, cardiac outflow tract malalignment/septation defects, 
ventricular/atrioventricular septal defects, and various right/left heart 
obstructive lesions® (Fig. 1 and Supplementary Videos 1, 2). The CHD 
diagnoses were confirmed with micro-computed tomography (CT)/ 
micro-magnetic resonance imaging (MRI), necropsy and histopatho- 
logy (Extended Data Fig. 1). Heritability was ascertained with the recov- 
ery of two or more G3 fetuses with the same CHD phenotype (n = 4.6 
+ 2.6). Ultrasound scanning of 87,355 G3 fetuses from 2,651 mutant 
lines (defined by the G1 sire) recovered 218 CHD mutant lines. These 
were curated in the Mouse Genome Informatics Database (http:// 


www.informatics.jax.org/searchtool/Search.do¢query=b2b&submit= 
Quick%0D%0ASearch), with G1 sperm cryopreserved in the Jackson 
Laboratory (JAXMice). 

Many mutant lines (30%) exhibited complex CHD associated with 
heterotaxy—the randomization of left-right patterning. These hetero- 
taxy mutants often died prenatally. We note that clinical studies show 
that 16% of human fetuses with complex CHD have heterotaxy’, and 
that these also exhibit high prenatal/intrauterine death (30-60%)*”. Also 
commonly observed in mutant lines with or without laterality defects 
are double outlet right ventricle (DORV) and atrioventricular septal 
defects (AVSD), phenotypes that usually resulted in prenatal lethality. 
We note that DORV and AVSD are also observed with high prevalence 
in aborted/stillborn human fetuses’. Most heterotaxy mutant lines ex- 
hibited three distinct visceral organ situs phenotypes: normal situs soli- 
tus, mirror symmetric situs inversus, or randomization with heterotaxy, 
with complex CHD observed only with heterotaxy (Extended Data 
Fig. 2). These three phenotypes are found in patients and mice with 
mutations causing primary ciliary dyskinesia (PCD)'*"', a sinopulmo- 
nary disease arising from immotile/dyskinetic respiratory cilia. This 
reflects the common requirement for motile cilia in airway clearance 
and left-right patterning’’’*. Hence, we conducted videomicroscopy 
of the tracheal epithelia to assess cilia motility as a proxy for the node. 

Mutation recovery was conducted with whole-exome sequencing 
analysis of a single mutant from each of 113 mutant lines. Cumulatively 
this yielded 7,996 coding mutations in 4,809 genes (Extended Data Fig. 3a). 
Genotyping analysis identified the pathogenic mutation as one con- 
sistently homozygous across all G3 mutants bearing the same pheno- 
type. In mutant lines with multiple homozygous mutations on the same 
chromosome linked to the CHD phenotype, the pathogenic mutation 
was identified with segregation analysis of additional mutants, and/or 
based on phenotype comparison to existing knockout/mutant alleles. 
For five lines (b2b002, b2b012, b2b284, b2b1519 and b2b1146), the mu- 
tation was confirmed with mapping analyses’’. 

In this manner, 91 pathogenic mutations were recovered in 61 genes 
(Supplementary Data 1). The distribution of missense, splicing and 
nonsense mutations was similar to other ENU mutagenesis studies’* 
(Extended Data Fig. 3a). All mutations except one were in highly con- 
served amino acids (Supplementary Data 2). Five of the 19 splicing 
mutations are >2 bases from the canonical splice donor-acceptor sites, 
the farthest being 10 bases away (Dnah5“ 13°? -10T>4.49b1537; validated 
by complementary DNA sequencing, Extended Data Fig. 4). Multiple 
alleles were recovered in 15 genes (Extended Data Table 1). Of61 CHD 
genes recovered, 27 were not previously annotated to cause CHD (Ex- 
tended Data Fig. 3b), either because no mutant/knockout mice were avail- 
able, or because the CHD phenotype was not investigated (Supplementary 
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Figure 1 | Ultrasound diagnoses of CHD and cilia defects in CHD mutants. 
a, b, Vevo 2100 colour flow imaging showed criss-crossing of blood flow 
indicating normal aorta (Ao) and pulmonary artery (PA) alignment 

(a; Supplementary Video 1), confirmed by histopathology (b). Cd, caudal; Cr, 
cranial; L, left; LV, left ventricle; R, right; RV, right ventricle. c, d, Embryonic 
day (E)16.5 mutant mouse (line b2b327) exhibited a blood flow pattern 
indicating single great artery (pulmonary artery) and ventricular septal defect 
(VSD) (c), suggesting aortic atresia with ventricular septal defect, confirmed by 
histopathology (d). e-h, Colour flow imaging of E15.5 mutant mouse (line 
b2b2025) with heterotaxy (stomach on right; Supplementary Video 4) showed 
side by side aorta and pulmonary artery, with the aorta emerging from the 
right ventricle, indicating DORV/ventricular septal defect (e, f; Supplementary 


Data 1). In some instances, the existing mutants died early, precluding 
CHD analysis, such as the Smarca4-knockout mouse with peri- 
implantation lethality’. A Hectd1 mutant was previously described that 
only had neural tube defects'® versus our Hectd1’””*?” mutant, which 
was shown to have aortic atresia (Fig. 1c, d). The identical Adamts6 
mutation was found in three lines, and was shown to be a spontaneous 
mutation in the C57BL6/J support colony (Supplementary Data 1). 

Thirty-four of the 61 CHD genes are cilia-related (Fig. 2), 23 in 
laterality-mutant lines (Extended Data Table 2 and Extended Data 
Figs 3a, 5). Twelve are required for motile cilia function, with eight 
clinically known to cause PCD and three that are probably novel PCD 
genes (Ccdc151, Daw1 and Pkd 111) (Extended Data Tables 2, 3 and Sup- 
plementary Data 1). A mutation was recovered in Foxj1, a transcription 
factor known to regulate ciliogenesis. While Foxj1-knockout mice have 
no cilia in the airway”, our Foxj1”’’”* mutant harbouring a missense 
mutation in the DNA-binding domain has ciliated trachea with robust 
ciliary motion (Supplementary Video 5), but with no nodal flow (Sup- 
plementary Video 6). 

Mutations in 22 ciliome genes affect primary cilia—11 in laterality- 
and 11 in non-laterality-mutant lines (Extended Data Table 2 and Ex- 
tended Data Figs 3a, 5). Among non-laterality mutants are Cep110 (also 
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Video 2) and the presence of AVSD (g, h; Supplementary Videos 3,4). LA, left 
atrium; RA, right atrium. i-k, Histopathology also showed a bicuspid aortic 
valve (BAV) (i), interrupted aortic arch (IAA) (j), and common atrioventricular 
(AV) valve (k). 1-n, Cc2d2a-mutant mouse exhibits dextrocardia with 
ventricular inversion (dextroversion) (m), and AVSD (1) with malformed 
atrioventricular cushions (n), but normal outflow cushions. Atr, atrium; mLV, 
morphologic left ventricle; m/m, Cc2d2a-mutant mouse; mRV, morphologic 
right ventricle. o-x, Confocal imaging of E12.5 Cc2d2a-mutant mouse (m/m) 
versus wild-type (+/+) embryo sections showed no cilia in the atrioventrcular 
cushion (0, p), but normal ciliation in the outflow cushion (OFT cushion) 

(q, r). s-x, Fewer and shorter cilia were observed in other mutant embryo 
tissues. Red, acetylated tubulin (Acet-tub.); green, IFT88. 


known as Cntrl), a centrosome component, Jbts17, a cilia transition 
zone protein, and four other genes (Fuz, Myh10, Dctn5, Lrp2) that also 
have important roles in vesicular/endocytic trafficking, a biological pro- 
cess essential for ciliogenesis’® (Fig. 2). Many genes affecting laterality 
are associated with non-PCD ciliopathies (Anks6, Bicc1, Tbc1d32, 
Cep290, Dync2h1, Ift74, [ftl40, Nek8, Pcsk5, Pskh1, Cc2d2a, Tmem67; 
Extended Data Table 3)’. Analysis of a Cc2d2a-mutant embryo by con- 
focal imaging showed shorter and fewer cilia throughout the embryo 
(Fig. 1s—x). In the Cc2d2a-mutant embryo, whereas cilia were abundant 
in the outflow cushions, they were absent in the atrioventricular cush- 
ions (Fig. 1p, r). Consistent with this, Cc2d2a-mutant embryos die prena- 
tally with AVSD (Fig. 1], n), but with normal semilunar valves (Fig. 1m, n). 

Of the 26 non-ciliome CHD genes, 15 are known to mediate cell sig- 
nalling, including the SHH signalling pathway (Fuz, Kif7, Lrp2, Sufu, 
Tbc1d32), WNT/planar cell polarity signalling (Ptk7, Prickle1, Fuz), 
TGF-B/BMP signalling (Cfcl, Ltbp1, Megf8, Pcsk5, Smad6, Tab1) and 
calcium signalling (Pkd1l1, Pkd1) (Fig. 2). Five of these were recovered 
in the laterality-mutant lines (Tbc1d32, Cfcl, Megf8, Pcsk5, Pkd1l1). 
These pathways are all transduced or modulated by the cilium””, and 
have important roles in cardiovascular development and/or left-right 
patterning. 
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We also recovered ten CHD genes mediating vesicular/endocytic 
trafficking, including four (Lrp2, Myh10, Dctn5, Fuz) that are ciliome- 
related (Fig. 2). The enrichment for endocytic genes may be mechan- 
istically linked to cilia, as vesicular trafficking at the cilia base has 


important roles in both ciliogenesis and cilia-transduced cell signalling'*”’. 
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Figure 2 | CHD genes recovered from mouse mutagenesis screen. Diagrams 
illustrate biological context of CHD gene function (colour highlighting 
indicates CHD genes recovered; asterisks denote CHD genes recovered from 
previous screen”®). For clarity, ciliome genes (Dctn5, Fuz) with endocytic 
function and/or involved in cilia-transduced signalling are not shown in the 
ciliogenesis panel. AP, adaptor protein complex; R, receptor; MVB, multi- 
vesicular body; TGN, trans-Golgi network; Ub, ubiquitination. 


These endocytic genes caused a similar spectrum of cardiac outflow 
tract malalignment and septation defects (Extended Data Fig. 6). Only 
Ap1b1, an adaptin subunit mediating endocytic protein sorting in the 
Golgi, was observed to cause laterality defects (b2b1660; Extended Data 
Fig. 2). Whereas Ap1b1 mutants showed normal airway and embryonic 
node cilia motility, Ap1b1-mutant embryonic fibroblasts exhibited a 
defect in ciliogenesis (65% versus 85% ciliation in Ap1b1-mutant ver- 
sus wild-type fibroblasts; P = 5.90 X 10°). Ap2b1, an adaptin protein 
complex 2 subunit homologous to Ap1b1, functions in clathrin-mediated 
endocytosis. Snx17, known to regulate endocytic receptor recycling, 
has a mutation in the cargo-binding FERM domain, a region known to 
interact with the NPXY motif of the endocytic protein LRP1, which was 
also recovered in our screen (Extended Data Fig. 3b). 

Other CHD genes recovered included a novel zinc finger protein, 
Zbtb14, and two genes modulating chromatin, Smarca4 and Prdm1. 
Also recovered were six extracellular-matrix-related genes (Acan, Adamts6, 
Frem2, Lox, Mmp21, Ndst1), one (Lox) identified as cilia-related”’, three 
novel genes (Cml5, Gm572, Mmp21) causing CHD with laterality defects, 
and two genes (Acan, Pkd1) causing biventricular hypertrophy with- 
out other structural heart defects. 

Many of the CHD genes encode proteins known to physically inter- 
act with each other or that are in the same multiprotein complexes. For 
example, ANKS6 can bind BICC1 and also NEK8, with NEK8 and 
ANKS6 co-localizing in the cilia inversin compartment****. TMEM67, 
CEP290, JBTS17 and CC2D2A are co-localized in the NPHP1-4-8 
multiprotein complex in the cilia transition zone’. DNAH5, DNAH11, 
DNAII and DAW1 are associated with the outer dynein arm complex 
of motile cilia, while IFT 140 and IFT 74 are intraflagellar transport pro- 
tein components that together with DYNC2H1 mediate protein trans- 
port in the cilium (Fig. 2)'*”°. SNX17 can directly bind LRP1 to promote 
LRP1 cell surface recycling’*. These findings prompted an examination 
of the 113 exomes for the prevalence of additional mutations in the 61 
CHD genes. This revealed 58 additional incidental mutations in 32 of 
the CHD genes, a significant enrichment compared to random simu- 
lations (P = 0.00114; Methods), suggesting possible modifier effects. 

To assess further for potential protein-protein interactions (PPIs), 
we constructed an interactome of CHD genes using known PPIs in the 
public databases and our own computationally predicted PPIs (Methods). 
This yielded an interactome of 778 genes, 292 of which are predicted 
PPIs (Supplementary Data 3 and Fig. 3a). The average shortest dis- 
tance between CHD genes is 4.3 + 2.5 edges, whereas random genes of 
matched degree distribution had a larger distance of 5.7 + 4.4 edges 
(P = 0.01). Statistical analysis of the interactome revealed significant 
gene ontology (GO) term enrichment related to development, includ- 
ing heart development terms (Supplementary Data 3; Fig. 3b). Pathways 
identified included ‘clathrin-mediated endocytosis signalling’, “TGF-B 
signalling’, “BMP signalling’ and ‘axon pathfinding’ (Supplementary 
Data 3). Several CHD genes recovered are known to regulate axon guid- 
ance and neurogenesis, including Plxnd1, Robol and Cxcr4, while 
Pde2a regulates synaptic plasticity and long-term potentiation, and 
Myh10 is required for synaptic membrane recycling (Extended Data 
Fig. 7). 

Our study provides compelling evidence that the cilium has a critical 
role in CHD pathogenesis. The observed enrichment for cilia and cilia- 
related genes emerged from a phenotype-driven forward genetic screen 
agnostic to the specific genes that may be recovered. The finding of 
ciliome mutations in CHD mutant lines with and without laterality de- 
fects would suggest a broad role for cilia in CHD pathogenesis. While 
our study is focused on monogenic recessive coding mutations, we noted 
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Figure 3 | Interactome network of CHD genes with known and predicted 
interactors. a, An interactome network was constructed comprising the 61 
CHD genes (magenta squares) and their known (blue edges) and predicted 


a striking overlap of genes recovered in the mouse screen versus het- 
erozygous de novo coding mutations in CHD candidate genes iden- 
tified in CHD patients’. Of 28 CHD genes identified in the human 
study, 11 are cilia-related, 7 are in the TGF-B/SHH/WNT signalling 
pathway, and 2 are involved in vesicular/endocytic trafficking (Sup- 
plementary Data 4). Heterozygous mutation in PITX2, a gene essential 
for left-right patterning’, was recovered in a non-heterotaxy CHD 
patient, suggesting a broader role for left-right patterning in CHD path- 
ogenesis. The human study also identified an important role for genes 
regulating chromatin. Consistent with this, our screen recovered two 
mouse CHD genes modulating chromatin, Smarca4 and Prdm1 (ref. 5). 
Hence, findings from our fetal mouse screen may provide insights not 
only into the Mendelian recessive genetic contribution to CHD in the 
unborn, a segment of the CHD population not well studied, but these 
findings also may have relevance to the complex genetics of CHD 
overall. The CHD mouse models generated in this study are invaluable 
resources for interrogating CHD pathogenesis, with the cryopreserved 
mutant lines providing access to a large library of mutations, including 
null mutations for genes without KOMP alleles. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


CHD phenotyping. C57BL/6) mice were chemically mutagenized using ethylni- 
trosourea and subject to a two generation backcross breeding scheme”. To identify 
CHD mutants, G3 fetuses were ultrasound scanned with a two-tier ultrasound phe- 
notyping strategy using the Acuson Sequoia and Vevo2100 ultrasound systems as 
previously described®. The fetal ultrasound imaging was conducted at E13.5-15.5, 
after completion of major cardiovascular developmental processes, including out- 
flow tract septation and ventricular chamber septation. All of the CHD diagnoses 
were confirmed or refined by additional analysis using micro-CT/micro-MRI ana- 
lyses and three-dimensional histopathology using episcopic confocal microscopy*”” 
(Extended Data Fig. 1). The summative CHD diagnosis and phenotype description 
are curated in the public Mouse Genome Informatics (MGI) database (http:// 
www.informatics.jax.org; enter “b2b” into the search box to pull up all the lines or 
type in specific line IDs or MGI IDs to go directly to a specific mutant line). In- 
cluded in each mutant line MGI record are sample images and videos collected from 
various different imaging modalities, including ultrasound images and videos, nec- 
ropsy images, micro-CT/micro-MRI scans, episcopic confocal microscopy image 
stacks and three-dimensional reconstructions, histology, and high-speed videos for 
analysis of tracheal airway cilia motility. All animal experiments described in this 
study were conducted with approved Institutional Animal Care and Use Commit- 
tee (IACUC) protocols from the University of Pittsburgh. 

Cilia videomicroscopy. Trachea of newborn mice were harvested and mounted as 
previously described. For nodal cilia imaging, the node-containing region of E7.5- 
E7.75 embryos (3-6 somite stages) was dissected and mounted on a glass cover- 
slip'®. Videomicroscopy was carried out using a Leica DMIRE2 inverted micro- 
scope at 37 °C with a X 100 oil objective, and DIC optics. To examine flow, 0.35 tm 
microspheres were added to the medium bathing the ciliated tissue samples (~1 
10’! particles per ml; Polysciences). High-speed videos (200 fps) were collected 
using a Phantom v.4.2 camera (Vision Research). 

Cilia immnuostaining analysis. E12.5 mutant and control littermate embryos 
from b2b1035 were immunostained with ciliary protein antibodies including acety- 
lated o-tubulin antibody (Sigma T7451, 1:1,000) and IFT88 (1:500), as previously 
described”®. 

Mouse embryonic fibroblasts derived from E11.5-E12.5 b2b1660 mutant and 
control littermate embryos were serum starved for 24 to 48 h to grow out the cilia, 
with staining of ciliary proteins carried out using primary antibodies antibodies 
against acetylated «-tubulin (Sigma T7451, 1:1,000) and y-tubulin antibody (Sigma 
T6557, 1:1,000). All immnostaining samples were imaged by confocal microscopy. 
Mutation recovery with whole-exome sequencing analysis. Genomic DNA from 
CHD mutant mice were sequence captured using Agilent SureSelect Mouse All 
Exon kit V1, and sequenced using Illumina HiSeq 2000 with minimum average 
50 target sequence coverage (BGI Americas). Sequence reads were aligned to the 
C57BL/6] mouse reference genome (mm9) and analysed using CLCBio Genomic 
Workbench and GATK software. To minimize false negatives, variant calls were 
set at 5X minimum coverage and =20% alternate reads. Sequence variants were 
annotated with annovar (http://www.openbioinformatics.org/annovar/) and filtered 
against dbSNP128 and our in-house mouse exome databases with custom scripts. 
Intronic variants within 10 bp from the splice junction were annotated as potential 
splicing mutation. All of the mutations recovered are searchable together with 
phenotype information via the public Bench to Bassinet Congenital Heart Disease 
Mouse Mutation Database (http://benchtobassinet.com/ForResearchers/Basic 
ScienceDataResourceSharing/GeneDiscoveryinMouseModels.aspx). The exome data 
sets are available from the GNomEx CvDC Datahub (https://b2b.hci.utah.edu/ 
gnomex/gnomexGuestFlex.jsp?topicNumber=67). 

Ciliome gene annotation. Ciliome genes were derived from a compilation of mul- 
tiple bioinformatics, genomic and proteomic studies in different species”. This in- 
cluded a bioinformatics study of X-box-containing genes in Caenorhabditis elegans 
and several proteomic and comparative genomic studies comprising ciliated organ- 
isms in several species (C. elegans, Drosophila, haman, Chlamydomonas, Plasmodium 
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falciparum and Trypanosoma). In addition, we also included genes recently iden- 
tified in the cilia transition zone complex” and genes identified to cause primary 
ciliary dyskinesia (http://ghr.nlm.nih.gov/condition/primary-ciliary-dyskinesia/ 
show/Related+ Gene(s)). 

Analysis of splicing defect mutations. For analysis of splicing defect mutations, 
RNA was isolated from skin, trachea or heart tissue and reverse transcribed to gen- 
erate cDNA using a High Capacity RNA-to-cDNA Kit (Life Technologies). PCR 
amplification of the cDNA was carried out with primers spanning the affected 
splice junction using AmpliTaq Gold DNA polymerase system (Life Technologies) 
(95 °C for 5 min, 40 cycles at 95 °C for 30s, 58 °C for 30s at 72 °C for 1 min, 72 °C 
for 5 min). The amplified products were separated on a 3% agarose gel and the bands 
were excised using the QIAquick Gel Extraction Kit (Qiagen) and analysed by 
Sanger capillary sequencing. 

Simulation-based mutation enrichment analysis. A simulation approach was 
used to test whether mutations in the 61 CHD genes are enriched in the mouse 
exome data sets generated from the 113 CHD mutant mouse lines. From the 113 
exomes, we recovered 149 mutations in the 61 CHD genes, 91 of which were iden- 
tified as pathogenic, yielding an excess of 58 incidental mutations. For this simu- 
lation, all the genes from the mouse genome were grouped in bins of 500 genes 
based on coding DNA sequence (CDS) length. Then 61 genes with the same size 
distribution as the observed 61 CHD genes were randomly sampled. For each bin, 
we randomly sampled the same number of genes as those observed for the CHD 
genes, and counted the number of mutations in these genes in the 113 mouse exo- 
mes. On the basis of 50,000 sets of 61 randomly sampled genes, we calculated a 
P-value based on the count of random gene sets that harboured 149 or more muta- 
tions. To take potential gene coverage bias into account, we limited our random 
gene selection to genes with at least one coding mutation in our CHD exomes, 
which yielded a P value of 0.00114. When we choose random genes from all genes, 
including genes not found in our exome data set, the P value is <0.00002. 
Interactome analysis. To construct the interactome, each of the mouse CHD 
genes was mapped to its human orthologue, and PPIs of these genes were extracted 
from the Human Protein Reference Database (HPRD; http://www.hprd.org) and 
BioGRID (http://thebiogrid.org). The interactome is augmented with PPIs discov- 
ered computationally using the recently developed HiPPIP model (unpublished 
results). Pathway and GO term enrichment analyses of the interactome were carried 
out using Ingenuity Pathway Analysis suite and BINGO” plugin of Cytoscape 
(http://www.cytoscape.org/). P value for each pathway and GO term was calculated 
using Fisher’s exact test with Benjamini-Hochberg correction (Supplementary 
Data 3). 

Shortest paths were computed using Dijkstra’s algorithm implemented in the 
networkx package of python. Average length of the shortest paths between pairs of 
CHD genes was compared to that of 61 random genes, where random genes were 
sampled to have similar degree distribution as the 61 CHD genes. For this, all human 
genes were categorized into 7 bins based on their degree in the interactome (0, 1-5, 
6-25, 26-50, 51-75, 75-100, >100). As many random genes were selected from 
any bin as the number of CHD genes in that bin. This process was repeated 1,500 
times, and average length of the shortest paths was computed each time. P value is 
computed empirically to determine statistical significance. 
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Extended Data Figure 1 | Breeding, phenotyping and mutation recovery as a distinct pedigree or mutant line. b, Pipeline for recovery and curation and 
pipeline for mouse forward genetic screen. a, Two generation backcross cryopreservation of CHD mutant mouse models and the recovery of 
breeding scheme used to generate G3 mutants with recessive mutations causing pathogenic CHD causing mutations. 

congenital heart defects, with all offspring from a single G1 male defined 
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b2b1660 


Extended Data Figure 2 | Situs anomalies and congenital heart defects in 
ApIb1 5261660 wtants. a—c, Mutants from line 1660, identified with an ApIb1 
mutation, exhibit situs solitus (a), situs inversus (b) or heterotaxy (c). Situs 
solitus, characterized by normal left-right visceral organ positioning, the heart 
apex (arrow) points to the left (levocardia), four lung lobes are on the right 
and one on the left, stomach is to the left, and the dominant liver lobe is on 
the right. With situs inversus, there is complete mirror reversal of organ situs, 
while with heterotaxy, visceral organ situs is randomized, such as dextrocardia 
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with levogastria shown in c. d-g, The heterotaxy mutant in c exhibits complex 
CHD with atrioventricular septal defect (AVSD) (d), ventricular septal 

defect (VSD) (e), duplicated inferior vena cava (IVC) (f) and left pulmonary 
isomerism with bilateral single lung lobes (g). Ao, aorta; L1-5, lung lobes 1-5; 
Lv1-3, live lobes 1-3; mLA, morphologic left atrium; mLV, morphologic 

left ventricle; mRA, morphologic right atrium; mRV, morphologic right 
ventricle; PA, pulmonary artery; Stm, stomach. 
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a All Mutations* 


n=7,996 


™ Heterozygous ™ Missense 
™ Homozygous © Splicing 
@ Nonsense 


*Mutations found in 
4,809 genes. 


Pathogenic Mutation Type 
n=91 


Recovery of Pathogenic Mutations 


Gene 


Cardiac and Laterality Phenotype 
Complex CHD with heterotaxy can 


Gene 


Ciliome-CHD Genes 
n=34 
& Motile Cilia Genes in 
Laterality Lines 


@ Primary Cilia Genes 
in Non-Laterality 
Lines 

® Primary Cilia Genes 
in Laterality Lines 

Motile cilia genes not found 

in Non-laterality lines. 


Cardiac Phenotype 
iventricular hypertrophy 


ApIbl ‘omplex CHD with heterotaxy mis6 | DORV, overriding aorta, AVSD, ventricular noncompaction 


word 
cins 
Faz 
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eae 


Dnahll Complex CHD with heterotaxy Ox 
Complex CHD with heterotaxy Lrp1 
Complex CHD with heterotaxy 
Complex CHD with heterotaxy 


D 
IS 


DyxIcl Complex CHD with heterotaxy dst1 
Complex CHD with heterotaxy ‘de2a 
Gm572 Complex CHD with heterotaxy kdl 


Tftl40 Laterality defects with CHD 
Ift74 Laterality defects with CHD 


Dync2h1 Laterality defects with CHD, MAPCA 
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‘ectdl 
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Libp! 
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| SS iS > Sl= 
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Plxndl 
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TGA,AVSD,VSD, aortic arch defcts, biventricular hypertrophy 
DORV, AVSD, VSD, RAA, hypoplastic PA 


Ao atresia,hypoplastic transverse arch,double Ao arch, VSD 
DORV, overriding aorta, VSD 


DORV, overriding Ao, hypoplastic Ao, aortic arch defects, VSD 
Biventricular hypertrophy 
PTA, AVSD, RAA, MAPCA 


Aortic atresia,hypoplastic Ao,dysplastic semilunar valves, VSD 


Biventricular hypertrophy 
PTA,DORV, overriding Ao, hypoplastic PAAAVSD, RAA 


DORV, AVSD, aortic valve atresia, VSD, RAA/vascular ring 


TGA with heterotaxy Pricklel_| DORV, overriding aorta, hypoplastic PA 


Mmp21 TGA with heterotaxy Ptk7 
Complex CHD with heterotaxy obol 
Dextro/mesocardia,PTA/DORV/AVSD MAPCA | Smad6 
omplex CHD with heterotaxy 

Pskh1 Complex CHD with heterotaxy 

Theld32 Complex CHD with heterotax Sufu 
Complex CHD with heterotax Tabl 
pe tH TA 


Extended Data Figure 3 | Distribution of pathogenic mutations recovered 
from the forward genetic screen. a, Distribution of all incidental coding 
mutations (left), pathogenic mutations (middle) and ciliome CHD genes (right) 
recovered from 113 CHD mouse mutant lines. b, Recovery of pathogenic 
mutations and associated CHD phenotypes. Grey-filled boxes indicate CHD 


nx17 


mutations in genes not previously identified to cause CHD. Ao, aorta; AVSD, 
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DORYV, VSD, biventricular hypertrophy 
PTA,DORV,TGA,Ao valve dysplasia,hypoplastic PA,ZRAA/IAA, VSD 
marcad 


DORV, AVSD, semilunar valve defects 


atrioventricular septal defect; BVH, biventricular hypertrophy; DORV, 
double outlet right ventricle; IAA, interrupted aortic arch; MAPCA, major 
aortopulmonary collateral artery; PA, pulmonary artery; PTA, persistent 

truncus arteriosus; RAA, right aortic arch; TGA, transposition of the great 
arteries; VSD, ventricular septal defect; VS, vascular sling. 
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a Bases Splicing 
Nucleotide from Site 

Gene Chr | Position Change exon (bp) | Affected 
Dnail 4 41544973 c.240+1G>A 1 Donor 
Gm572 4 | 148017493 c.26+1G>A 1 Donor 
Plxnd1 6 | 115919325 c.2691+1G>A 1 Donor 
Cedc151 9 21799255 c.828+2T>C 2 Donor 
Ccdc39 3 33737998 c.357+2TSA 2 Donor 
Cep290 10 | 99962906 c.1189+2T>C 2 Donor 
Dnahil 12 | 119425356 c.1137+2T>C 2 Donor 
Dnah!1 12 | 119282224 c.6489+2T>C 2 Donor 
Dnah5 15 | 28275670 c.7398+2T>A 2 Donor 
Dnahs5 15 | 28159416 ¢.438+2T>A 2 Donor 
Fuz 7 52152701 c.387+2T>OA 2 Donor 
Thc1ld32 10 | 55915160 c.1071+2T>A 2 Donor 
Biccl 10 | 70420407 c.606+2T>C 2 Donor 
Nastl 18 | 60850830 c.2426+2T>A 2 Donor 

Lrp2 2 | 69316757 c.8456-3A>G 3 Acceptor 
Tmem67 4 11974951] c.2322+6T>C 6 Donor 
Dnm2 9 21282798 c.1196+6T>A 6 Donor 
Detn5 7 129281879 c.348+6T>C 6 Donor 

Dnah5 15 | 28388293 c.13329-10T>A 10 Acceptor 

b Dnahs@ 13329-10174 eae c.13329-10T>A a 
— | <— 


| Exon 75 | Exon 76 


Exon 78 | Exon 79 | 


—>>> —— 
Primer A. Primer B 
[+—— 271 bp —— 
Cc 
Exon 76 Exon 77 d 


500 bp—> 
400 b> > 
300 bp—> 


200 bp—> 


116 bp 


Primer A= GCCTGGCTGATGATATGCTA 
Primer B= GGAGGAGCAGAAATGTCGTC 


<— 514 bp WT 


271 bp mutant 


Extended Data Figure 4 | Pathogenic splicing mutations causing CHD. 

a, The 19 pathogenic splicing mutations recovered are shown, with mutations 
located beyond the 2-base canonical splice junction highlighted in grey. 

b, Schematic diagram showing the anomalous Dnah5“'777?°°T4 mutant 
transcript observed, with the polymerase chain reaction (PCR) primer location 
and anomalous PCR product size indicated. c, Sanger sequencing profile 


5¢-133290-10T >A 


showing point mutation in Dnah mutant transcript versus 

that of wild type. d, PCR amplification of Dnah5“'*???"°T4 heterozygous 
mutant (m/+) showed the expected 514 bp wild-type and 271 bp mutant PCR 
product, while only the 271 bp mutant product was observed in the 
homozygous mutant (m/m) sample. 
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113 Mutant Mouse Lines 
Exome Sequencing Analyzed 


v 


91 Pathogenic CHD Causing Mutations Recovered 


22 ciliome genes likely 
primary cilia related (65%) 


12 ciliome genes required for 
motile cilia function (35%) 


Extended Data Figure 5 | Ciliome mutations causing CHD with and lines, and further stratification of ciliome CHD genes affecting primary versus 
without laterality defects. A flow chart showing the distribution of ciliome — motile cilia function. 
versus non-ciliome CHD genes among laterality versus non-laterality CHD 
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b2b2321 


o b2b2159 a 


b2b1625a 


a. 


@ b2b1625b 


b2b1554 


Extended Data Figure 6 | CHD phenotypes associated with mutations 
affecting endocytic trafficking. a—c, Outflow tract malalignment defects 
with double outlet right ventricle (DORV) and overriding aorta (Ao) were 
observed in Ap2b1 (a), Dnm2 (b) and Snx17 (c) mutants, with Ap2b1 (a) 
mutant also showing anterior positioning of the aorta (Taussig—Bing type 
DORV). Snx17 mutant also has AVSD. d, e, Lrp2 (d) and Lrp1 (e) mutants both 
exhibited outflow tract septation defect with persistent truncus arteriosus 
(PTA). Scale bar, 0.5mm. 
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Extracellular (Cxctt2 
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Extended Data Figure 7 | CHD genes associated with axon guidance. recovered from the present screen). Adapted from QIAGEN’s Ingenuity 
Diagram illustrating the biological context of several CHD genes known tobe _— Pathway Analysis (http://www.qiagen.com/ingenuity). 
involved in axonal guidance (colour highlighting indicates CHD genes 
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Extended Data Table 1 | CHD genes with multiple alleles 
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Gene Symbol Gene Symbol Mouse CDS # Alleles 

(mouse) (human) (bp) Recovered 
Daw!l1 DAWI 933 2 
Dnail DNAI 2,106 2 
Drcl DRCI 2,262 2 
Pde2a PDE2A 2,808 2 
Ccdc39 CCDC39 2,814 3 
Tmem67 TMEM67 2,988 2 
Armc4 ARMC4 3,114 L 
Adamts6 ADAMTS6 3,351 i 
Pcsk5 PCSK5 5,634 2 
Plxndl PLXNDI1 5,778 2 
Cep290 CEP290 7,440 2 
Pkdill PKDIL1I 7,824 2 
Megf8 MEGF8 8,370 2 
Dnahil DNAHI1 13,467 8 
Dnah5 DNAHS 13,866 9 


*|dentical mutation was recovered from two different mutant lines. 
+ Identical mutation was recovered from three different mutant lines, later confirmed to be a spontaneous mutation in the support colony. 
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Extended Data Table 2 | Ciliome mutations in laterality and non-laterality lines 


Laterality Mutant Lines (30 
Ciliome Genes Non-Ciliome Genes 


Non-Laterality Mutant Lines (31 


Ciliome Genes Non-Ciliome Genes 


Anks6 
Armcé4 (m) 
Biccl 
Cc2d2a 
Ccdc151 (m) 
Ccdc39 (m) 
Cep290 
Daw! (m) 
Dnaaf3 (m) 
Dnahs (m) 
DnahI11(m) 
Dnail (m) 
Drcl (m) 
Dync2h!1 
DyxIcl (m) 
Foxj1 (m) 
[ft140 
Tft74 
Nek& 
Pkdill (m) 
Pskhl 
Thc1d32 
Tmem67 


Total: 23 (12) 


m, mutation affects motile cilia function. 


Aplbl 
Cfel 
Cml5 

Gm572 

Megf8 

Mmp21 

Pesk5 

Total: 7 
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Cep110 
Detn5 
Fuz 
Jbts17 
Kif7 
Lrp2 
Lox 
Myh10 
Pde2a 
Pkdl 
Sufu 
Total: 11 


Acan 
Adamts6 
Ap2b!1 
Cxcr4 
Dnm2 
Frem2 
Hectd1 
Lrp1 
Ltbp!1 
Nast1 
Plxnd1 
Prdm1 
Pricklel 
Ptk7 
Robol 
Smad6 
Smarca4 
Snx17 
Tab1 
Zbtb14 
Total: 20 
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Extended Data Table 3 | Mouse CHD genes and associated human diseases 


Ciliopathy Related 


Armed 
Cede39 
Dnaaf3 
DnahIl 
Dnahs 
Dnail 
Drel 


DyxIcl Primary ciliary dyskinesia, Kartagener syndrome 


Ce2d2a 
Leber Congenital Amaurosis 10, Meckel Syndrome-4 
Sbts17 


Joubert Syndrome; Nephronophthisis-11, Meckel Syndrome, COACH syndrome, 
Bardet Biedl Syndrome 

Anks6 Polycystic kidney disease; nephronophthisis 16 

Biccl Polycystic kidney disease; cystic renal dysplasia 


B 


Tmem67 


Neks Polycystic kidney disease; nephronophthisis 9 
Pkdl Polycystic kidney disease 
Pkdill Kidney disease, diaphanospondylodysostosis 
Dync2h1 _| Short rib polydactyly; asphyxiating thoracic dystrophy 
Tft140 Mainzer-Saldino Syndrome, short rib thoracic dysplasia 
Other Human Diseases 

Ap2b1 Ataxia telangiectasia, cerebellar degeneration 


Cfel 
Gerd 
Dum2 
Frem2 
Lipl 


Pesks 
Prdmi 
Smad6 
Smarcad 


Sufu Medulloblastoma, basal cell nevus syndrome 


Ltbp1 Pseudoexfoliation Syndrome 


w [S85 
5 as 


©2015 Macmillan Publishers Limited. All rights reserved 


ees A A Tea 


doi:10.1038/nature14300 


Coordination of mitophagy and mitochondrial 
biogenesis during ageing in C. elegans 


Konstantinos Palikaras!, Eirini Lionaki! & Nektarios Tavernarakis!? 


Impaired mitochondrial maintenance in disparate cell types is a 
shared hallmark of many human pathologies and ageing’ *. How 
mitochondrial biogenesis coordinates with the removal of damaged 
or superfluous mitochondria to maintain cellular homeostasis is 
not well understood. Here we show that mitophagy, a selective type 
of autophagy targeting mitochondria for degradation, interfaces 
with mitochondrial biogenesis to regulate mitochondrial content 
and longevity in Caenorhabditis elegans. We find that DCT-1 is a 
key mediator of mitophagy and longevity assurance under condi- 
tions of stress in C. elegans. Impairment of mitophagy compromises 
stress resistance and triggers mitochondrial retrograde signalling 
through the SKN-1 transcription factor that regulates both mito- 
chondrial biogenesis genes and mitophagy by enhancing DCT-1 
expression. Our findings reveal a homeostatic feedback loop that 
integrates metabolic signals to coordinate the biogenesis and turn- 
over of mitochondria. Uncoupling of these two processes during 
ageing contributes to overproliferation of damaged mitochondria 
and decline of cellular function. 

A common denominator of numerous pathological conditions and 
ageing is a deregulation of cellular mitochondrial content’*. To 
uncover the cellular and molecular underpinnings of mitochondrial 
mass homeostasis, we examined the involvement of mitochondria- 
selective autophagy (mitophagy) in regulating mitochondrial content 
of C. elegans during ageing. Mitochondria gradually accrue with age in 
wild-type nematodes' (Fig. la and Extended Data Fig. 1a-d). 
Depletion of the Atg6/Vps30/Beclinl homologue BEC-1, the main 
autophagy regulator in worms’, recapitulates the effect of ageing on 
mitochondrial mass in young adult animals (Fig. 1b). Thus, failure of 
autophagy impairs the removal of mitochondria and may contribute to 
progressive accumulation of mitochondria during ageing. 

To target mitophagy specifically, we knocked down dct-1 (DAF-16/ 
FOXO Controlled, germline Tumour affecting-1)’°”, a putative ortho- 
logue to the mammalian NIX/BNIP3L and BNIP3 (Nip3-like protein 
X/Bcl-2 and adenovirus E1B interacting protein; Extended Data Fig. 
2a), which act as mitophagy receptors in mammals’*"*. DCT-1 is 
widely expressed throughout development as an integral membrane 
protein and localized on the outer mitochondrial membrane 
(Extended Data Fig. 2b-g). Expression of dct-1 is partly controlled 
by the FOXO transcription factor DAF-16 and is elevated upon low 
insulin/IGF-1 signalling'®"' (Extended Data Fig. le, f). Similarly to 
BEC-1, DCT-1 deficiency increases intestinal and muscle cell mito- 
chondrial content in young adult animals and distorts mitochondrial 
network morphology (Fig. 1c, d and Extended Data Figs 1g-k and 
3a-c). DCT-1 encompasses a WXXL motif and co-localizes with the 
nematode Atg8/LC3 autophagosome protein LGG-1 on the outer 
mitochondrial membrane’*’ (Extended Data Figs 2a and 4a). These 
findings indicate that DCT-1 is the nematode orthologue of NIX/ 
BNIP3L and functions as a key regulator of mitophagy. 

To confirm that impairment of mitophagy underlies mitochondrial 
accumulation in worms, we developed two composite systems for 
monitoring mitophagy in vivo. First, we generated transgenic animals 


expressing mitochondria-targeted Rosella (mtRosella) biosensor that 
combines a fast-maturing pH-insensitive DsRed fused to a pH-sensitive 
green fluorescent protein (GFP) variant. To validate this tool, we exam- 
ined animals carrying mtRosella, under normal and mitophagy-inducing 
conditions (heat stress, carbonyl cyanide m-chlorophenylhydrazone 
(CCCP) and paraquat exposure). All treatments reduce the GFP/ 
DsRed ratio of Rosella fluorescence, indicating stimulation of mito- 
phagy. Knockdown of Igg-1 gene, a core component of the 
general autophagic machinery, blocks mitophagy (Extended Data 
Fig. 4f-h). Additionally, we generated transgenic animals expressing 
a mitochondria-targeted GFP, together with the autophagosomal 
marker LGG-1 fused with DsRed in body wall muscle cells, and 
looked for co-localization under normal and mitophagy-inducing 
conditions. Mitophagy-inducing stimuli induce the formation of 
autophagosomes that extensively co-localize with mitochondria 
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Figure 1 | Selective mitochondrial autophagy regulates mitochondrial 
content and morphology. Transgenic animals expressing mitochondria- 
targeted GFP in the intestine were monitored (a) throughout adulthood for 
mitochondrial content. Indicative images are shown on the right for days 1, 3,5 
and 10 (n = 90; ***P < 0.001; one-way analysis of variance (ANOVA)). 

b, BEC-1 deficiency increases mitochondrial mass (n = 100; ***P < 0.001; 
unpaired t-test). c, Knockdown of either DCT-1, PINK-1 or PDR-1 increases 
mitochondrial mass (n = 100; ***P < 0.001; one-way ANOVA). Scale bar, 
100 um. d, Depletion of either DCT-1, PINK-1 or PDR-1 alters mitochondrial 
network morphology in body wall muscle cells (n = 120; ***P < 0.001; one- 
way ANOVA). Scale bar, 20 um. Anterior, left; posterior, right. e, Induction of 
mitophagy is indicated by co-localization of GFP and DsRed (for each group of 
images, mitochondria are shown in green on top, autophagosomes in red 
below, with a merged image at the bottom). Scale bars, 20 jm. Images were 
acquired using X5 and X40 objective lenses. Error bars, s.e.m. 
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a b Figure 2 | Mitophagy is required for longevity 
100 = WT under conditions of low insulin/IGF-1 signalling 
- -* Igg-1(RNA\) or impaired mitochondrial function. 
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Days Days 

(Extended Data Fig. 5a, b). Pronounced induction of mitophagy was 
also observed in long-lived daf-2 mutants (Fig. le and Extended Data 
Figs 4f, h and 5b). In all cases, induction of mitophagy is entirely 
dependent on DCT-1 (Fig. le and Extended Data Figs 4f, g and 5a). 
Additionally, we tested the requirement of the mitochondrial phos- 
phatase and tensin (PTEN)-induced kinase 1 (PINK-1) and the cyto- 
solic E3 ubiquitin ligase Parkin, two critical conserved components of 
mitophagy’®. Consistent with their previously described role, PINK-1 
and PDR-1 (the nematode Parkin homologue) deficiency blocks 
mitophagy under stress or low insulin/IGF-1 signalling, and desta- 
bilizes mitochondrial network in wild-type C. elegans cells (Fig. 1c-e 
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Figure 3 | Mitophagy deficiency compromises stress resistance and impairs 
mitochondrial function. Mutants dct-1(tm376), pink-1(tm1779), pdr-1(gk448) 
and wild-type animals incubated a, at 37 °C for 7h (n = 150; ***P < 0.001; one- 
way ANOVA), b, in CCCP for 2h (m = 150; ***P < 0.001; one-way ANOVA), 
or c, exposed to 500J m ” of ultraviolet radiation (n = 150; ***P < 0.001; one- 
way ANOVA), d, treated with 8 mM paraquat (n = 150; ***P < 0.001; one-way 
ANOVA). e, Survival after 5 days of starvation (n = 150; ***P < 0.001; one-way 
ANOVA). f, ATP levels are decreased in mitophagy mutants (***P < 0.001; 
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and Extended Data Figs 1i, j, 4f, g and 5a). Increased mitochondrial 
mass in mitophagy-depleted animals is not a compensatory protect- 
ive mechanism against extensive dysfunctional mitochondria or due 
to enhanced mitochondrial biogenesis, impaired proteasomal activ- 
ity or altered tissue-specific promoter activity of the reporters 
(Extended Data Fig. 6a—h). Furthermore, mitochondrial morphology 
defects upon mitochondrial dynamics impairment are epistatic to mor- 
phological aberrations occurring upon mitophagy inhibition, suggest- 
ing the latter are secondary events (Extended Data Fig. 3a—c). 
Depletion of CED-9, the nematode homologue of cell death inhib- 
itor Bcl-2 and physical interactor of DCT-1 (refs 17, 18), causes mito- 
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one-way ANOVA). g, Mitophagy-deficient animals display increased 
mitochondrial ROS formation (n = 180; ***P < 0.001; one-way ANOVA). 

h, Mitophagy mutants display decreased ratio of dysfunctional mitochondrial to 
total mitochondrial mass (n = 170; **P < 0.01; one-way ANOVA). i, Oxygen 
consumption rates (***P < 0.001; one-way ANOVA). j, Transgenic animals 
expressing the Ca”*-reporter upon knockdown of either dct-1 or pink-1 
incubated at 37 °C for 3h, with or without the presence of CCCP (n = 120; 
***P < 0,001, **P < 0.01; one-way ANOVA). Error bars, s.e.m. 
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chondrial mass increase without affecting DCT-1 mitochondrial local- 
ization (Extended Data Fig. 7a, b). Interestingly, CED-9 is required for 
mitophagy upon oxidative and mitochondrial stress (Extended Data 
Fig. 7c, d). Simultaneous knocking down of ced-9/dct-1, ced-9/pink-1 
and ced-9/pdr-1 does not further increase mitochondrial content, indi- 
cating that ced-9, dct-1, pink-1 and pdr-1 act in the same genetic path- 
way to regulate mitophagy (Extended Data Fig. 7b). 

To investigate the contribution of mitophagy to longevity, we 
knocked down DCT-1, PINK-1 and PDR-1 in wild-type and long-lived 
mutants. Notably, DCT-1, PINK-1 and PDR-1 impairment functions 
non-redundantly to abrogate elevated autophagy in long-lived daf-2 
mutants and remarkably shortens their lifespan, whereas it does not 
alter basal autophagy or lifespan of wild-type animals (Fig. 2a-c, 
Extended Data Figs 4c-e and 8a-f and Supplementary Table 1). 
Moreover, dct-1 or pink-1 knockdown shortens lifespan extension of 
mutants with moderated mitochondrial dysfunction (isp-1 and clk-1), 
or the inherently caloric-restricted eat-2 mutants (Fig. 2d, Extended 
Data Fig. 8g, h and Supplementary Table 1). We conclude that 
mitophagy is a common longevity assurance process, underlying the 
pro-survival effects of diverse manipulations that extend lifespan. 

To elucidate the mechanism by which mitophagy contributes to 
longevity, we tested mitophagy-depleted animals for stress resistance. 
The dct-1, pink-1 and pdr-1 mutants are substantially more sensitive to 
various stressors (Fig. 3a—e and Extended Data Fig. 9a, b), whereas 
DTC-1 overexpression confers PINK-1- and PDR-1-dependent pro- 
tection against stress. Interestingly, the WXXL motif of DTC-1 is 
required for DCT-1-dependent stress resistance (Extended Data 
Figs 81 and 9g-k). We hypothesized that diminished stress resistance 
may originate from the accumulation of dysfunctional mitochondria 
in mitophagy mutants. Indeed, in mitophagy-deficient animals, 
although mitochondrial DNA (mtDNA) levels remain relatively 
unchanged (Extended Data Fig. 9f), we noted decreased ATP levels, 
elevated mitochondrial reactive oxygen species (ROS) generation, 
mitochondrial membrane depolarization, increased oxygen con- 
sumption and cytoplasmic Ca** elevation that becomes exacerbated 
under stress (Fig. 3f-j and Extended Data Fig. 9c-e). These findings, 
combined with the distorted mitochondrial network morphology 
noted above, indicate that mitophagy impairment precipitates pro- 
nounced mitochondrial dysfunction. 

To examine the molecular interaction between DCT-1, PINK-1 and 
PDR-1, we immunoprecipitated DCT-1 upon mitophagy stimulation, 
and analysed tryptic peptides by mass spectrometry. We found that 
DCT-1 is ubiquitinated on lysine 26, a modification that becomes 
enriched under mitophagy-inducing conditions and depends on 
PINK-1 (Extended Data Fig. 91-0). Additionally, DCT-1 co-localizes 
with PDR-1, indicating that DCT-1 ubiquitination upon mitophagy is 
PDR-1-dependent (Extended Data Fig. 4b). 

Despite the aforementioned defects, mitophagy-depleted animals 
show normal lifespan in the absence of stress stimuli (Fig. 2b-d and 
Extended Data Fig. 8a—h). To resolve this apparent paradox, we con- 
sidered whether accumulation of dysfunctional mitochondria upon 
mitophagy impairment signals a compensatory retrograde response 
that induces expression of nuclear genes encoding mitochondrial pro- 
teins. Accumulation of damaged mitochondria induces oxidative 
stress. In mammalian cells, the nuclear factor-erythroid 2-related fac- 
tor 2 (NRF2/NFE2L2) is activated under oxidative stress conditions 
and orchestrates the expression of several genes promoting sur- 
vival'**°. SKN-1 (SKiNhead-1), the nematode homologue of NRF2, 
was recently found to associate with outer mitochondrial membrane 
proteins”’. SKN-1 transcriptional activity is pivotal for mitohormesis- 
mediated longevity”, and is implicated in the maintenance of 
mitochondrial homeostasis**”®. Interestingly, SKN-1 becomes specif- 
ically activated upon mitophagy inhibition (Fig. 4a and Extended Data 
Fig. 10a—-d). SKN-1 is required for the expression of several mitochon- 
drial biogenesis genes as well as the mitophagy mediator dct-1 
(Fig. 4b and Extended Data Fig. 10e, h). In addition to DAF-16, 
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Figure 4 | Mitophagy deficiency engages the mitochondrial retrograde 
signalling pathway through activation of SKN-1. a, Fluorescence intensity of 
transgenic animals expressing the pg.;.4GFP transgene subjected to either dct-1, 
pink-1 and pdr-1 knockdown (n = 180; ***P < 0.001; one-way ANOVA). 
Scale bars, 100 um. b, Messenger RNA (mRNA) levels of atp-5, gas-1, hmg-5, 
tim-17, WO9C5.8 and dct-1 in wild-type and SKN-1-depleted animals 

(*P <0.01; one-way ANOVA). c, mRNA levels of dct-1 in animals with the 
indicated genetic backgrounds (***P < 0.001, *P < 0.01; one-way ANOVA). 
d, Knockdown of SKN-1 alters mitochondrial network morphology in body 
wall muscle cells (n = 120; ***P < 0.001; unpaired t-test). Scale bar, 20 tum. 
e, TMRE staining of wild-type and SKN-1-deficient animals (n = 150; 

***P < 0.001; one-way ANOVA). f, Transgenic animals expressing the Ca”* 
reporter upon knockdown of skn-1 (n = 120; ***P < 0.001; unpaired t-test). 
g, Mitophagy is not activated in SKN-1-deficient animals under low insulin 
signalling conditions (for each group of images mitochondria are shown in 
green on top, autophagosomes in red below, with a merged image at the 
bottom). Scale bars, 20 um. h, The contribution of the DCT-1 mitophagy 
receptor and SKN-1 to mitochondrial physiology and consequently to cellular 
homeostasis is depicted. DCT-1 is an essential mitophagy mediator that is 
transcriptionally regulated by both DAF-16 and SKN-1. Under normal 
conditions, basal-level mitophagy maintains the mitochondrial pool and 
regulates mitochondrial number, depending on the metabolic state of the cell. 
Mitophagy is triggered under stress conditions to eliminate damaged 
mitochondria and promote cell survival. Concomitant activation of SKN-1 
enhances mitophagy and promotes mitochondrial biogenesis by stimulating 
expression of DCT-1 and several mitochondrial genes. Error bars, s.e.m. Images 
were acquired using <5 and X40 objective lenses. 
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dct-1 expression is also controlled by SKN-1 (Fig. 4c). These findings 
suggest that SKN-1 may control mitophagy through regulation of dct-1 
transcription. Indeed, skn-1 knockdown impairs mitochondrial net- 
work morphology, causes mitochondrial membrane depolarization 
and increases cytoplasmic Ca** concentration (Fig. 4d-f and 
Extended Data Fig. 10g), similarly to DCT-1 deficiency. Notably, 
SKN-1 depletion reduces mtDNA content, further highlighting the 
role of SKN-1 in the maintenance of mitochondrial integrity 
(Extended Data Fig. 10f). Furthermore, induction of mitophagy under 
low insulin/IGF-1 signalling or stress is diminished in the absence of 
SKN-1 (Fig. 4g and Extended Data Fig. 10i, j, 1). DAF-16 activity is also 
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required for mitophagy stimulation. Simultaneous depletion of either 
DAF-16 and DCT-1, or SKN-1 and DCT-1, does not cause a more 
pronounced mitophagy defect (Extended Data Fig. 10j), indicating 
that DAF-16, SKN-1 and DCT-1 are components of a common genetic 
pathway regulating mitophagy. Knockdown of either dct-1 or pink-1 
shortens the lifespan of skn-1 mutant in contrast to wild-type animals 
(Extended Data Fig. 10k and Supplementary Table 1), indicating 
that SKN-1 activation compensates the reduced basal mitophagy to 
maintain normal lifespan. Treatment of mitophagy-deficient animals 
with the ROS scavengers N-acetyl-L-cysteine (NAC) and butylated 
hydroxyanisole (BHA)’’** does not change their lifespan, ruling out 
the involvement of mitohormetic feedback signalling (Extended Data 
Fig. 8i, j and Supplementary Table 1). Surprisingly, supplementation of 
ethylene glycol tetraacetic acid (EGTA), a calcium-specific chelating 
agent, shortens the lifespan of mitophagy-deficient worms without 
influencing lifespan of wild-type animals (Extended Data Fig. 8k). 
Furthermore, SKN-1 activation is diminished in mitophagy-depleted 
animals upon EGTA treatment (Extended Data Fig. 10n). Elevated 
cytoplasmic calcium induces mitochondrial biogenesis through cal- 
cium/calmodulin-dependent protein kinase II (CaMKII), p38 mitogen 
activated protein kinase (MAPK), and PGC-1a”**. Indeed, UNC-43 
(the nematode homologue of CaMKII) regulates the transcriptional 
activity of SKN-1 upon mitophagy inhibition, where cytoplasmic cal- 
cium levels are increased (Extended Data Fig. 10m, 0). Our observa- 
tions suggest that cytoplasmic calcium elevation ensuing upon 
mitophagy impairment mediates SKN-1 activation, which overcomes 
the cost of basal mitophagy reduction on longevity. 

Overall, our analysis uncovers a key regulatory mechanism that inter- 
faces mitochondrial biogenesis with mitochondrial clearance through 
selective autophagy, to preserve energy homeostasis in C. elegans. A 
central node of the pathway is the mitophagy mediator DCT-1/NIX, 
which integrates insulin/IGF-1 signals via DAF-16 and SKN-1, in addi- 
tion to other stress signals, relayed through SKN-1, to control mitophagy 
(Fig. 4h). Accumulation of damaged or superfluous mitochondria 
causes oxidative stress, initiating a bipartite retrograde response that is 
mediated by SKN-1 and involves the coordinated induction of both 
mitochondrial biogenesis and mitophagy genes. This closed feedback 
loop preserves mitochondrial quality by neutralizing damaged, and 
generating fresh mitochondria (Fig. 4h). Moreover, coordination of 
biogenesis and turnover of mitochondria allows cells to adjust their 
mitochondrial content in response to physiological demands, stress 
and other intracellular or environmental stimuli. In this context, age- 
related decline of mitophagy both hinders removal of damaged mito- 
chondria and impairs mitochondrial biogenesis to instigate progressive 
mitochondrial accumulation. The tight evolutionary conservation and 
ubiquitous expression of the regulatory factors involved in this closely 
orchestrated response suggest that similar pathways might uphold mito- 
chondrial homeostasis across different taxa. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. 
Strains and genetics. We followed standard procedures for C. elegans strain 
maintenance*'. Nematode rearing temperature was kept at 20°C, unless noted 
otherwise. The following strains were used in this study: N2: wild-type Bristol 
isolate, CB1370: daf-2(e1370)III, CB4876: clk-1(e2519)III, MQ887: isp-1(qm150) 
IV, DA465: eat-2(ad465)II, CF1038: daf-16(mu86)I, pink-1(tm1779)II, dct- 
1(tm376)X, VC1024: pdr-1(gk448)IIL, N2Ex[Pnyo-sINVOM:REP], $J4103: N2;Is 
[Pmyo-smtGFEP], $J4143: N2;Is[Pges.1mtGFP], CL2166: N2;Is{[psr-4GFP], CF1553: 
N2;Is[psoa-3GFP], $J4005: N2;Is[Prsp <GEP]V, $J4058: N2;Is[Prsp-6oGEP]V, EU: 
skn-1(zu67) IV/nT1(IV;V), EU31: skn-1(zu135)IV/nT1(IV;V), EU40:  skn- 
1(zu129)IV/nT1(IV;V). To investigate the localization of DCT-1, we examined 
IR1431: N2;Ex001[Pact-,DCT-1::GFP] and IR1607: N23Is[Pynyo-3INVOM::RFP]; 
Ex001[Pdct-1DCT-1::GFP] transgenic animals. The following strains were used 
to monitor mitophagy process: IR1284: N2;Is[Pyyo-3mtGFP];Ex011[Pigg-1 
DsRed::LGG-1], IR1511: N2;Ex001[Pmyo-3DsRed::LGG-1;pct-,DCT-1::GFP] and 
IR1631: N2;Ex003[Pnyo-3T OMM-20::Rosella]. To examine mitochondrial mor- 
phology and mitochondrial mass under mitophagy-deficient conditions, we used 
the SJ4103: N2;Is[Pmyo-smtGFP], IR1298: dct-1(tm376);Is[Pmyo-3mtGFP] and 
SJ4143: N2;Is[Pges-1mtGFP], IR1505: N2;Ex001[Punc-119CTS-1::mCherry] strains. 
To monitor autophagy induction after inhibition of mitophagy, we used the 
DA2123: N2;Is[Pjgg-7GFP::LGG-1], MAH14: daf-2(e1370);Is[Pjgg.1GFP::LGG-1] 
and IR621: N2;Ex002[Pjgg.;DsRed::LGG-1] strain. To monitor DCT-1 and PDR- 
1 interaction in vivo, we used the following transgenic animal: IR1739: 
N2;Ex001[Pmyo-3PDR-1::DsRed;pg¢;.; DCT-1::GFP]. To investigate the protective 
role of global expression of dct-1 under mitophagy inducing conditions, we exam- 
ined the following transgenic animals: IR1431: N2;Ex001[pacr.;DCT-1::GFP], 
IR1487: _ pink-1(tm1779);Ex001[Pacr-1DCT-1:GEP], IR1733: _ pdr-1(gk448); 
Ex001[Pdc-1DCT-1:GEP], IR1666: dct-1(tm376);Ex002[Pier.s5sDCT-1], IR1668: 
dct-1(tm376);Ex001[Pje¢-35g3CT-1(AMER)], IR1667: dct-1(tm376);Ex003[Piet-gss 
DCT-1(4BH3)], IR1669: dct-1(tm376);Ex003[Pje-s5sDCT-1(AWXXL)]. The fol- 
lowing strain was used for monitoring cytoplasmic calcium levels: IR1155: 
N2;Ex001[piet-sssGCaMP2.0]. To monitor the levels of ROS, such as H2O., in 
mitophagy-deficient animals, we used the JRIS1: N2;Is[p,pi.17HyPer] strain. 
Molecular cloning. To generate the pg-j;DCT-1::GFP reporter construct, we 
fused a KpnI fragment, containing the coding sequence of dct-1 and 2 kb upstream 
of the coding sequence, amplified from C. elegans genomic DNA using the primers 
5'-GGATCCCATCTTCTGTACCACGCA-3’ and 5’-GGTACCGCTTGACGAT 
GATGTTTGAT-3’, at the amino (N) terminus of GFP, in the pPD95.77 plasmid 
vector. The translational p4,;.;DCT-1::GFP fusion construct was co-injected with 
pRF4 (contains the rol-6(su1006) dominant transformation marker) into the 
gonads of wild-type animals. To generate the pjer.g5gGCaMP2.0 reporter construct, 
we removed the mec-17 promoter form Pymec-17GCaMP2.0 described previously”’, 
and inserted a HindIII/Smal fragment (derived from the L3786 plasmid vector), 
containing the sequence of the /et-858 promoter upstream of GCaMP2.0. The 
resulting plasmid construct was co-injected with pRF4 into the gonad of wild-type 
animals. To generate Pnyo-3DsRed::LGG-1, we inserted an Agel/EcoRI fragment 
(derived from Pjgg-; DsRed::LGG-1**), containing the coding sequence of DsRed 
downstream of the myo-3 promoter of the pPD96.52 plasmid vector. We then 
fused an EcoRI fragment (derived from Ppjgg.;DsRed::LGG-1**) containing the 
coding sequence of /gg-1 at the carboxy (C) terminus of DsRed of the 
Pmyo-3DsRed. The translational p,,yo3DsRed::_LGG-lfusion construct was co- 
injected with Pmyo-2GFP transformation marker. To generate P,,y..3TOMM- 
20::Rosella, we inserted an EcoRI fragment (derived from the mtRosella yeast 
vector’***) containing the coding sequence of Rosella biosensor in pPD96.52 
under myo-3 promoter. We then inserted a BamHI fragment containing the 
two first exons of tomm-20 gene downstream of the myo-3 promoter and upstream 
of the Rosella coding sequence. The tomm-20 fragment was amplified from C. 
elegans genomic DNA using the primers 5’-GGATCCATGTCGGACACAATTC 
TTGG-3’ and 5'-GGATCCTTCGCGAGTCGACGGC-3’. The translational 
Pmyo-3 (OMM-20::Rosella fusion construct was co-injected with pRF4 into the 
gonads of wild-type animals. To generate pjyo-3PDR-1::DsRed, we inserted an 
Agel/BamHI fragment containing the full length pdr-1 gene without the stop 
codon, amplified from C. elegans genomic DNA using the primers 5’-GTCGAC 
ACTGCAAGCAGGCCCGC-3’ and 5’-ACCGGTGGATTAAACCAATGGTCC 
CATTG-3’, at the N terminus of DsRed, in the pPD96.52 plasmid vector. The 
translational p,,,y.3PDR-1::DsRed fusion construct was co-injected with pRF4 into 
the gonads of wild-type animals. To generate pjnc-779CTS- 1::mCherry, we inserted 
a Xbal/XmnI fragment containing the first two exons of cts-1 gene, amplified from 
C. elegans genomic DNA using the primers 5'’-ATGTCGCTCTCTGGAATGGC- 
3' and 5'-GGTGCTACCGTGCTCTGTGC-3’, at the N terminus of GFP, in the 
pPD95.77 plasmid vector. We then inserted a HindIII/Xbal fragment, amplified 
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from C. elegans genomic DNA using the primers 5'-CTCTCAAAATTCAGTTTT 
TTAAACCAC-3’ and 5’-ATATGCTGTTGTAGCTGAAAATTTTG-3’ contain- 
ing the sequence of the unc- 119 promoter upstream of CTS-1::GEFP. Pync-119CTS-1 
was inserted as a KpnI fragment upstream of mCherry in the pPD95.77 plasmid 
vector. The translational p,,¢-779CTS-1::mCherry fusion construct was co-injected 
with pRF4 into the gonads of wild-type animals. To generate pje:-sssDCT-1 rescue 
plasmid, we inserted an XbaI/Xhol fragment containing dct-1 complementary 
DNA (cDNA) sequence in pPD96.52. The dct-1 fragment was amplified from 
C. elegans cDNA using the primers 5'-TCTAGAATGTCCTCATTTCTTGAGT 
TTGC-3' and 5’'-CTCGAGTTGACGATGATGTTTGATTAGTTTC-3’. We 
then removed the myo-3 promoter from pPD96.52 and inserted a HindIII/ 
Xmal fragment (derived from the L3786 plasmid vector), containing the let-858 
promoter, upstream of dct-1. The translational pj).;.35sDCT-1 fusion construct was 
co-injected with pRF4 into the gonads of dct-1(tm376) animals. To generate the 
Plet-sssDCT-1(AWXXL) plasmid that contains a deletion of WXXL motif, we 
inserted an Xbal/Agel fragment fused with an Agel/Xhol fragment of the dct-1 
cDNA sequence in pPD96.52, spanning the sequence that encodes for WXXL 
motif. The dct-1(AWXXL) fragment was amplified from C. elegans cDNA using 
the primers 5’-TCTAGAATGTCCTCATTTCTTGAGTTTGC-3’ and 5’-ACCG 
GTCGATTCTGACATACCAGGTG-3’, 5’-ACCGGTGCACCAAGCCGTACC 
AGTT-3' and 5'-CTCGAGTTGACGATGATGTITGATTAGTTTC-3’. We the 
removed the myo-3 promoter from pPD96.52 and we inserted a HindIII/Xmal 
fragment (derived from the L3786 plasmid vector), containing the sequence of the 
let-858 promoter upstream of DCT-1(4WXXL). The translational pjo;35sDCT- 
1(A4WXXL) fusion construct was co-injected with pRF4 into the gonads of dct- 
1(tm376) animals. To generate pj_.3ssDCT-1(4MER) plasmid that contains a 
deletion of the MER domain, we inserted an Xbal/Agel fragment fused with an 
Agel/Xhol fragment of dct-1 cDNA sequence in pPD96.52 spanning the sequence 
that encodes for MER domain. dct-1(4MER) fragment was amplified from C. 
elegans cDNA using the primers 5'-TCTAGAATGTCCTCATTTCTTGAGTTT 
GC-3' and 5'-ACCGGTGACGCTACTGCACAAACT-3’, 5’-ACCGGTGTCTC 
AATCGCACAATCTC-3’ and 5’-CTCGAGTTGACGATGATGTTTGATTAGT 
TTC-3’. We then removed the myo-3 promoter from pPD96.52 and inserted a 
HindIII/Xmal fragment (derived from the L3786 plasmid vector), containing the 
let-858 promoter upstream of dct-1(4MER). The translational pj:sssDCT-1 
(AMER) fusion construct was co-injected with pRF4 into the gonads of dct-1 
(tm376) animals. To generate pje;-gssDCT-1(4BH3) plasmid that contains a dele- 
tion of the BH3 domain, we inserted an Xbal/Agel fragment fused with an Agel/ 
Xhol fragment of dct-1 cDNA sequence in pPD96.52, spanning the sequence that 
encodes for the BH3 domain. dct-1(4BH3) fragment was amplified from C. elegans 
cDNA using the primers 5’-TCTAGAATGTCCTCATTTCTTGAGTTTGC-3’ 
and 5’-ACCGGTGTACTTTACTTGTTCTAAGCTCTC-3’, 5’-ACCGGTTGG 
AGCAGTCGGCCA-3’ and 5’-CTCGAGTTGACGATGATGTTTGATTAGTT 
TC-3'. We then removed the myo-3 promoter from pPD96.52 and inserted a 
HindIII/Xmal fragment (derived from the L3786 plasmid vector), containing 
the let-858 promoter upstream of dct-1(4BH3). The translational pjerssgDCT- 
1(4BH3) fusion construct was co-injected with pRF4 into the gonads of dct- 
1(tm376) animals. For engineering the skn-1, dct-1, pdr-1, ced-9, pbs-5, rpn-6, 
fzo-1, drp-1, isp-1, clk-1 and unc-43 RNAi constructs, gene-specific fragments of 
interest were obtained by PCR amplification directly from C. elegans genomic 
DNA using the following sets of primers 5'-TCAGTTCACCATCGTCCAACA 
CCTC-3’ and 5’-CTGCTGCTCCAACGACAAAGCTG-3’, 5'-TTGCGTCCGG 
CGAGAAGACG-3' and 5’-CTGCTGCTCCAACGACAAAGCTG-3’, 5'-GTCG 
ACACTGCAAGCAGGCCCGC-3’ and 5'-ACCGGTGGATTAAACCAATGGT 
CCCATTG-3’, 5'’-ATGACACGCTGCACGGCGG-3’ and 5’-CTTCAAGCTGA 
ACATCATCCGCCC-3’, —5'-ACCGGTATGTGGGGCGAGACATTCG-3'and 
5'-GGGCCCACGTCATCAACACCCAGCC-3’, 5'-GAAACGAGTTCACGAG 
AAGACACTA-3' and 5'-GGAAGGTCAAGCATCACCTGAA-3’, 5'-ATTCG 
CGTCGTGCTGCCG-3’ and 5'-ATGGCGTTGGCGGAGAGTCC-3’, 5’-ATG 
GAAAATCTCATTCCTGTCGTC-3’ and 5’-CGCTCTTTCTCCGGTTGATG- 
3', 5'-ATGGCTTCTCTTGCTAGATCTGGAGG-3’ and 5'-AGTGAGATCCG 
TGACAAGGGCAG-3’, 5’-ATGTTCCGTGTAATAACCCGTG-3’ and 5’-TCT 
CAGCAATCGCAATAGCTC-3’, 5’-CCAAGTTTAGTGACAATTACGATGT 
G-3' and 5'-CITTCTTGAGAACTTCTGGCGACA-3’ respectively. The PCR- 
generated fragments were subcloned into the pL4440 plasmid vector. The resulting 
constructs were transformed into HT115(DE3) Escherichia coli bacteria deficient 
for RNase III. Bacteria carrying an empty vector were used in control experiments. 
For skn-1, pbs-5 and rpn-6 RNAi, the effect can be severe, leading to embryonic 
lethality of the animals. 

mRNA quantification. To quantify dct-1, atp-5, hmg-5, gas-1, tim-17, W09C5.8, 
pink-1, pdr-1, ges-1, myo-3 and unc-119 mRNA levels, total RNA was extracted 
using the TRIzol reagent (Invitrogen). The following sets of primers were used, 
respectively: 5’-GGCTCCAACCTTACCACTCC-3’ and 5'-GCAAATCCTACT 
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GCTGCTCC-3’, 5’-ACTGGTCGAAGCTCGCCGAG-3' and 5’-ACTCGGCT 
GGAACCTCTCCG-3’, 5'-CGTCCAAGTGTTCCTCCAAGTG-3’ and 5’-CTT 
CGCTTCGTCTGTGTACTTCTTT-3’, 5'-CTGGATGTTCGAGGAACGTG-3’ 
and 5’-AGCTGACACAAGTCCGATGTC-3’, 5'-TGCGAGAAGTACGAATG 
AGAAGCAC-3' and 5'-CGAGACGAATCCTTTCCATCTTTGG-3’, 5'-ATGC 
TGCCACGTTTGGCTCTCC-3’ and 5'-ACGGAAGCTGTAGCGGTACAAGA 
GC-3', 5'-AGCATATCGAATCGCAAATGAGTTAG-3’ and 5'-TCGACCGTG 
GCGAGTTACAAG-3’, 5’-AGCCACCGAGCGATTGATTGC-3’ and 5'-GTG 
GCATTTTGGGCATCTTCTTG-3’, 5’-TCGCCAAGAGGTATGCTTCACAA 
G-3' and 5'-TGCTGCTCCTGCACTGTATCCC-3’, 5’-AGCCAGACGGTGAG 
GAAGATGC-3’ and 5’-TGCGTCAAGGGTCTTGTTACATCTGG-3’, 5’-AAC 
AACAACAATCGATCGCACC-3’ and 5'-CACGTGTTCCGTGTCGAGGTC- 
3’. Results were normalized to genomic DNA using the following primers specific 
for ama-1: 5'-CCTACCTACACTCCAAGTCCATCG-3’ and 5'-GGTGAAGCT 
GGCGAATACGTTG-3’ and pmp-3: 5’-ATGATAAATCAGCGTCCCGAC-3’ 
and 5’-TTGCAACGAGAGCAACTGAAC-3’. For cDNA synthesis, mRNA was 
reverse transcribed using a iScriptTM cDNA Synthesis Kit (BioRad) and 
PrimeScriptTM Reverse Transcriptase (Takara). Quantitative PCR was performed 
in triplicate using a Bio-Rad CFX96 Real-Time PCR system (Bio-Rad). 

Ca’* monitoring. For intracellular Ca** monitoring experiments, transgenic 
animals expressing the Ca”* reporter GCaMP2.0 globally were examined under 
a Zeiss AxioImager Z2 epifluorescence microscope. L4 larvae were grown on RNAi 
plates with or without the presence of 15 ym CCCP. RNAi-treated animals against 
dct-1 and pink-1 were monitored on day 4 of adulthood. For heat stress assay, L4 
larvae were grown on RNAi plates. On day 4 of adulthood, worms were monitored 
after 3 h of incubation at 37 °C. Worms were immobilized with levamisole before 
mounting on 2% agarose pads for microscopic examination with a Zeiss 
AxioImager Z2 epifluorescence microscope. Images were acquired under the same 
exposure. Average pixel intensity values were calculated by sampling images of 
different animals. We calculated the mean and maximum pixel intensity for each 
animal in these images using the ImageJ software (http://rsb.info.nih.gov/ij/). 
Mean values were compared using unpaired t-tests. For multiple comparisons, 
we used the one-factor (ANOVA) variance analysis corrected by the post hoc 
Bonferroni test. For each experiment, at least 100 animals were examined for each 
strain/condition. Each assay was repeated at least three times. We used the Prism 
software package (GraphPad Software) for statistical analyses. 

Mitochondrial imaging. For MitoTracker staining, 500 tl heat-inactivated OP50 
(65 °C, 30 min) were mixed with 100 pl MitoTracker Red FM (catalogue number 
M-22425; Molecular Probes, Invitrogen) stock solution (100 1M) and seeded on 
agar plates. Animals were incubated for 2 days with the compound. Stained and 
washed worms were immobilized with levamisole before mounting on 2% agarose 
pads for microscopic examination with a Zeiss Axiolmager Z2 epifluorescence 
microscope. For the quantification of mitochondrial ROS, 500 pl heat-inactivated 
OP50 (65 °C, 30 min) were mixed with 100 ul MitoTracker Red CM-H2X ROS 
(catalogue number M-7513; Molecular Probes, Invitrogen) stock solution 
(100 1M) and seeded on agar plates. Animals were incubated for 2 days with 
corresponding compound. Stained and washed worms were immobilized with 
levamisole before mounting on 2% agarose pads for microscopic examination 
with a Zeiss Axiolmager Z2 epifluorescence microscope. For TMRE staining 
(tetramethylrhodamine, ethyl ester, perchlorate; a dye that accumulates in intact, 
respiring mitochondria), 1-day-adult animals were grown at 20 °C or 25 °C in the 
presence of 150nM TMRE for 24h. Stained and washed worms were immobilized 
with levamisole before mounting on 2% agarose pads for microscopic examination 
with a Zeiss Axiolmager Z2 epifluorescence microscope. Images were acquired 
under the same exposure. Average pixel intensity values were calculated by sam- 
pling images of different animals. We calculated the mean and maximum pixel 
intensity for each animal in these images using the Image] software (http://rsb.in- 
fo.nih.gov/ij/). For each experiment, at least 150 animals were examined for each 
strain/condition. Each assay was repeated at least three times. We used the Prism 
software package (GraphPad Software) for statistical analyses. 

Stress assays. To evaluate thermotolerance (heat stress assay), 4-day-old adult 
hermaphrodites were placed on pre-warmed (37 °C) NGM plates and incubated at 
37°C. After incubation for 7h, plates were removed; worms were recovered at 
20°C for 1h. Then, worms were scored for motility, provoked movement and 
pharyngeal pumping. Worms failing to display any of these traits were scored as 
dead. Three distinct populations of 150 adults (for each strain) were scored repeat- 
edly over the assay period. For CCCP stress assays, 4-day-old adult hermaphro- 
dites were incubated for 2 h at 20°C in M9 buffer containing CCCP at a final 
concentration of 151m. Animals were scored for survival after a 1-h recovery 
period at 20°C. Three distinct populations of 150 adults (for each strain) were 
scored repeatedly over the assay period. For ultraviolet stress assays, 4-day-old 
adult hermaphrodites were exposed to 500Jm * ultraviolet irradiation using a 
ultraviolet crosslinker (BIO-LINK-BLX-E365, Vilber Lourmat). Three days after 


exposure, animals were scored for survival. Three distinct populations of 150 
adults (for each strain) were scored repeatedly over the assay period. For oxidative 
stress assays, 7-day-old adult hermaphrodites were exposed to paraquat at a final 
concentration of 8mM. Animals were scored for survival after 2 days at 20°C. 
Three distinct populations of 150 adults (for each strain) were scored repeatedly 
over the assay period. For the starvation assay, worm eggs were collected upon 
hypochlorite treatment of gravid adults, resuspended in M9 buffer and incubated 
at 20°C. Newly hatched worms were arrested in L1 larval stage owing to lack of 
food. Survival under starvation was assessed every 2 days by taking aliquots and 
scoring live animals. Three distinct populations (for each strain) were scored 
repeatedly over the assay period. We used the Prism software package 
(GraphPad Software) for statistical analyses. 

ATP measurements. To determine ATP content, 50 4-day-old adult hermaph- 
rodites were collected in 50 ml of M9 buffer and frozen at — 80 °C. Frozen worms 
were immersed in boiling water for 15 min, cooled and centrifuged to pellet 
insoluble debris. The supernatant was moved to a fresh tube and diluted tenfold 
before measurement. ATP content was determined by using the Roche ATP 
bioluminescent assay kit HSII (Roche Applied Science) and a TD-20/20 lumin- 
ometer (Turner Designs). ATP levels were normalized to total protein content. We 
used the Prism software package (GraphPad Software) for statistical analyses. 
Mitochondrial DNA quantification. mtDNA was quantified using quantitative 
real time PCR as described previously**. We used the primers 5’-GTTTATGCT 
GCTGTAGCGTG-3’ and 5’-CTGTTAAAGCAAGTGGACGAG-3’ (Mitol set) 
for mtDNA. The results were normalized to genomic DNA using the following 
primers specific for ama-1:5'-TGGAACTCTGGAGTCACACC-3’ and 5’-CATC 
CTCCTTCATTGAACGG-3’, Quantitative PCR was performed using a Bio-Rad 
CFX96 Real-Time PCR system, and was repeated three times. 

Oxygen consumption rate measurements. Oxygen consumption rates were 
measured as previously described*”** using a Clark-type electrode with some 
minor modifications (Hansatech Instruments). Young adult worms were washed 
and collected in M9 buffer. Approximately 100 ul of slurry pellet of worms were 
delivered into the chamber in 1 ml of M9 buffer. The chamber was kept at 20 °C, 
and measurements were done for 10-15 min, depending on the oxygen consump- 
tion rate. The slope of the straight portion of the plot was used to derive the oxygen 
consumption rate. Worms were recovered after respiration measurements and 
collected for protein quantification. Rates were normalized to protein content. We 
performed three independent measurements per strain. We used the Prism soft- 
ware package (GraphPad Software) for statistical analysis. 

Western blotting. Protein samples were analysed by 8% Tricine-SDS-polyacryla- 
mide gel electrophoresis (SDS-PAGE), transferred on nitrocellulose membrane 
and blotted against various antibodies. The antibodies used in the study were 
anti-GFP (rabbit polyclonal, Minotech Biotechnology), anti-HSP-60 (mouse 
monoclonal-Developmental Studies Hybridoma Bank) and anti--tubulin (mouse 
monoclonal-Abcam). For total worm protein extraction, protein samples were 
produced by directly boiling worms in 2X Laemmli sample buffer with B-mercap- 
toethanol supplemented with 5 mM PMSF and complete mini proteinase inhibitor 
cocktail (Roche). 

Mitochondria isolation. Mixed populations of worms were incubated in 10 mM 
DTT in M9 for 30min at 4°C with rotation. DIT was washed off with three 
successive centrifugations and dilutions in M9. Worm bodies were homogenized 
by hand with a 3 ml Potter-Elvehjem homogenizer with PTFE pestle and glass tube 
(Sigma-Aldrich) in isolation buffer (50 mM Tris-HCl pH 7.4, 210 mM mannitol, 
70 mM sucrose, 0.1 mM EDTA, 2mM PMSF, and complete mini protease inhib- 
itor cocktail (Roche), the last two being added just before use). The progress of 
homogenization was monitored by visual inspection under a stereoscope. The 
suspension was centrifuged at 200g for 1 min and the supernatant fraction kept 
separately while the pellet was resuspended in isolation buffer and homogenized 
as previously. The homogenate was centrifuged again at 200g for 1 min. 
Supernatants were combined and centrifuged once more at low speed (200g) for 
1 min and then at high speed (12,000g). The pellet was resuspended in isolation 
buffer and centrifuged at 200g for 1 min. An aliquot of the supernatant was kept 
separately for protein concentration determination. High-speed centrifugation 
followed, and mitochondrial pellets were used directly for proteinase K treatment 
and carbonate extraction and/or SDS-PAGE. 

Proteinase K treatment. Fifty micrograms of crude mitochondria from mixed 
worm population were digested with 100 jig ml‘ proteinase K (Sigma-Aldrich) in 
isolation buffer minus protease inhibitors for 30 min. The reaction was inhibited 
with 5mM PMSF for 10min on ice. Proteinase K-shaved mitochondria were 
analysed by SDS-PAGE and western blotting. 

Sodium carbonate extraction. Fifty micrograms of crude mitochondria from 
mixed worm population were diluted in 0.1 M sodium carbonate (pH 11.5), and 
incubated on ice for 30 min. The suspension was centrifuged for 30 min at 55,000g 
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in a Beckman Optima TLX-120 ultracentrifuge. Soluble and membrane fractions 
were analysed by western blotting. 

Immunoprecipitation. For immunoprecipitation, mixed populations of worms 
were treated for 3h with 10 1M MG132 (Sigma-Aldrich) in M9 buffer and sub- 
sequently with 10 j1m MG132 and 60 mM paraquat (methyl viologen dochliride, 
Sigma Aldrich) in M9 buffer. Worm lysis was performed by hand with pellet pestle 
(Sigma-Aldrich) in lysis buffer (50 mM HEPES-KOH, pH 7.5, 150mM NaCl, 
1mM EDTA, 1mM EGTA, 10% glycerol, 1% Triton-X100, 1% NP-40, 0.1% 
SDS) supplemented with 1mM DTT, 5mM PMSF (Sigma-Aldrich), protease 
and phosphatase inhibitor cocktails (COmplete mini protease inhibitors cocktail 
tablets-ROCHE, phosphatase inhibitors cocktail-Cell Signaling), 20 1m NEM 
(Sigma-Aldrich) and 10 um MG132 just before use. The lysate was incubated 
on ice for 30min and then centrifuged for 15min at 16,000g at 4°C. 
Supernatant was diluted three times in dilution buffer (50mM HEPES-KOH, 
pH 7.5, 150mM NaCl, 1mM EDTA, 1 mM EGTA, 10% glycerol) supplemented 
with 5mM PMSF (Sigma-Aldrich), protease and phosphatase inhibitor cocktails 
(cOmplete mini protease inhibitors cocktail tablets (Roche), phosphatase inhibi- 
tors cocktail (Cell Signaling), 20 jum NEM (Sigma-Aldrich) and 10 Jum MG132 just 
before use. An equal amount of protein from each sample was incubated for 4h at 
4°C with Sepharose-Protein G beads (Amersham-GE Healthcare) pre-loaded 
with rabbit anti-GFP antibody (Minotech Biotechnology), cross-linked onto pro- 
tein G with BS* cross-linker (Thermo Scientific). The immunoprecipitated pro- 
teins were digested by trypsin for mass spectrometry analysis. 

Sample preparation and mass spectrometry analysis. Sepharose beads carrying 
the immunoprecipitated material were washed twice in 50 mM ammonium bicar- 
bonate buffer. Subsequently, 1 yg of trypsin (sequencing grade (Roche Applied 
Science)) was added per sample and digestion was performed on beads for 14h at 
37 °C. Tryptic peptides were dried and analysed by nanoflow liquid chromato- 
graphy-electrospray ionization with tandem mass spectrometry (nLC-ESI-MS/ 
MS) on an EASY-nLC system (Thermo Scientific, software version 2.7.6 #1) 
coupled with an LTQ-Orbitrap XL ETD (Thermo Scientific) through a nES ion 
source (Thermo Scientific). Data were acquired with Xcalibur software (LTQ 
Tune 2.5.5 spl, Thermo Scientific). The nLC-MS/MS raw data were loaded in 
Proteome Discoverer 1.3.0.339 (Thermo Scientific) and run using both Mascot 
2.3.02 (Matrix Science) and Sequest (Thermo Scientific) search algorithms against 
the C. elegans theoretical proteome (last modified 11 June 2014, version 108). Mass 
spectrometry was performed in-house in the Proteomics Facility at Institute of 
Molecular Biology and Biotechnology, Foundation for Research and Technology - 
Hellas (http://profi.imbb.forth.gr). 

Lifespan assays. Lifespan assays were performed at 20 °C unless noted otherwise. 
Synchronous animal populations were generated by hypochlorite treatment of 
gravid adults to obtain tightly synchronized embryos that were allowed to develop 
into adulthood under appropriate, defined conditions. The antioxidant NAC was 
administered at a final concentration of 10 mM. NAC was added into NGM 
media from a concentrated stock solution (500 mM) before plate pouring. The 
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antioxidant butylated hydroxyanisole (BHA) was administered at a final concen- 
tration of 25 jum. BHA was added into NGM media from a concentrated stock 
solution (50mM) before plate pouring. Ethidium bromide (EtBr) was adminis- 
tered at a final concentration of 5 pg ml’. EtBr was added into NGM media from 
a concentrated stock solution (10 mg ml” ') before plate pouring. Ethylene glycol 
tetraacetic acid (EGTA) was administered at a final concentration of 10mM. 
EGTA concentration was prepared by dilutions in 150 ul of sterilized water, from 
a concentrated stock solution (0.5 M), and applied to the top of the agar medium. 
Plates were then gently swirled to allow the drug to spread to the entire NGM 
surface. Identical drug-free water solutions were used for the control plates. For 
RNAi lifespan experiments, worms were placed on NGM plates containing 2mM 
IPTG and seeded with HT115(DE3) bacteria transformed with either the pL4440 
vector or the test RNAi construct. Progeny were grown at 20°C unless noted 
otherwise, through the L4 larval stage and then transferred to fresh plates in groups 
of 10-20 worms per plate for a total of 100-150 individuals per experiment. The 
day of egg collection and initiation of RNAi was used as t= 0. Animals were 
transferred to fresh plates every 2-4 days thereafter and examined every day for 
touch-provoked movement and pharyngeal pumping, until death. Worms that 
died owing to internally hatched eggs, an extruded gonad or desiccation due to 
crawling on the edge of the plates were censored and incorporated as such into 
the data set. Each survival assay was repeated at least twice and figures 
represent typical assays. Survival curves were created using the product-limit 
method of Kaplan and Meier. The log-rank (Mantel—Cox) test was used to evaluate 
differences between survivals and determine P values. We used the Prism 
software package (GraphPad Software) for statistical analysis and to determine 
lifespan values’. 
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Extended Data Figure 1 | Mitochondrial accumulation during ageing and 
upon mitophagy depletion. a, Stained wild-type animals with MitoTracker 
Red FM dye were monitored throughout adulthood for mitochondrial content. 
Indicative images are shown on the right for days 1, 3, 5 and 10 (n = 90; 
***P <0).001; one-way ANOVA). b, Total worm lysates from transgenic 
animals expressing a mitochondria-tagged GFP in the intestine on day 1, 3, 5 
and 10 of adulthood were loaded on Tricine-SDS-PAGE and immunoblotted 
against the indicated proteins. c, Transgenic animals expressing mitochondria- 
targeted GFP in the intestine were monitored throughout adulthood for 
mitochondrial content. (n = 100; ***P < 0.001; one-way ANOVA). 

d, Transgenic animals expressing mitochondria-targeted mCherry in neurons 
were monitored throughout adulthood for mitochondrial content. Indicative 
images are shown on the right for days 1, 3, 5 and 10 (n = 70; ***P < 0.001; 
one-way ANOVA). e, mRNA levels of dct-1 in wild-type animals, dct-1(tm376) 
mutants, dct-1(RNAi) animals, daf-2(e1370) and daf-16(mu86) mutants. 
Expression of dct-1 is transcriptionally regulated by DAF-16 (***P < 0.001; 
one-way ANOVA). f, Efficacy of dct-1 silencing by RNAi. Transgenic animals 
expressing a full-length pac.;DCT-1::GFP reporter fusion subjected to dct-1 
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RNAi. g, mRNA levels in ges-1 and myo-3 in wild-type animals, dct-1(RNAi) 
and pink-1(RNAi) animals. Expression of ges-1 and myo-3 is not changed upon 
mitophagy inhibition (NS, P > 0.05; one-way ANOVA). h, Mutant dct- 
1(tm376) animals expressing mitochondria-targeted GFP in body wall muscle 
cells display increased mitochondrial mass (n = 100; ***P < 0.0001; unpaired 
t-test). Scale bars, 100 jum. Images were acquired using a <5 objective lens. 

i, Knockdown of either DCT-1 or PINK-1 increases mitochondrial mass in 
worms that express a mitochondria-targeted GFP in body wall muscles 

(n = 120; ***P < 0.001; one-way ANOVA). Anterior, left; posterior, right. 

j, Stained wild-type animals and dct-1(tm376), pink-1(tm1779) and 
par-1(gk448) mutants with MitoTracker Red FM dye were monitored for 
mitochondrial content (n = 120; ***P < 0.001; one-way ANOVA). Scale bars, 
20 um. Images were acquired using a X10 objective lens. k, Mitochondrial 
network morphology is altered in dct-1(tm376) mutants. The mitochondrial 
network in wild-type animals is well-organized and runs parallel with the 
myofilament lattice. By contrast, dct-1(tm376) mutant animals display 
fragmented and disorganized mitochondrial network morphology. Scale bar, 
20 um. Images were acquired using a X40 objective lens. Error bars, s.e.m. 
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Extended Data Figure 2 | DCT-1 expression and sub-cellular localization. 
a, DCT-1 is the homologue of the mammalian BNIP3 and NIX/BNIP3L in C. 
elegans. Alignment of C. elegans DCT-1, M. musculus BNIP3 and M. musculus 
NIX/BNIP3L proteins. The predicted transmembrane domain of DCT-1, 
BNIP3 and NIX/BNIP3L is depicted in green. The conserved WXXL motif that 
is essential for interaction with the autophagosomal protein LC3/Atg8 is shown 
in red. The conserved MER and BH3 domains are depicted in blue and yellow, 
respectively. b, Spatiotemporal expression and sub-cellular localization of 
DCT-1. Images of transgenic animals expressing a full-length pac; DCT- 
1::GFP reporter fusion. Expression of dct-1 occurs in all somatic tissues of adult 
animals, including neurons, the pharynx, the intestine, body wall muscles and 
vulva muscles (indicated by the arrows). Scale bars, 50 tum. ¢c, Expression is 
detectable in embryos and remains high during all postembryonic 
developmental stages, throughout adulthood. Scale bars, 50 jm. Images were 
acquired using X 10 and X 40 objective lenses. d, e, DCT-1::GFP co-localizes 
with mitochondria in different tissues. Transgenic animal expressing a full- 
length pg¢;DCT-1::GFP reporter fusion localizing in mitochondria. Animals 
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were stained with Mitotracker Red FM (Invitrogen, Molecular Probes). Scale 
bars, 20 um. Images were acquired using a X 100 objective lens. f, Transgenic 
animal co-expressing a full-length pg.; ;DCT-1::GFP reporter fusion together 
with Pynyo-3INVOM::RFP outer mitochondrial membrane reporter. DCT- 
1::GFP co-localizes with INVOM::RFP, demonstrating that DCT-1 is an outer 
mitochondrial membrane protein. Scale bars, 10 um and 5 jim. Images were 
acquired using a X 100 objective lens. g, Isolated mitochondria from transgenic 
animals expressing a full-length pg.-;DCT-1::GFP reporter fusion were 
untreated (lane 1, M), treated with proteinase K (lane 2, MPS), or treated with 
sodium carbonate (lanes 3 and 4) to separate soluble from membrane material. 
DCT-1::GFP is accessible and degraded by proteinase K while HSP-60, a 
mitochondrial matrix protein, is protected. Soluble (lane 3, S) and membrane 
(lane 4, P) fractions after carbonate extraction are separated by 
ultracentrifugation. DCT-1::GFP is detected in the membrane fraction (P) 
while soluble HSP-60 protein is found in the soluble material, indicating that 
DCT-1 is an integral membrane protein. 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


fzo-1(RNAi) 


dct-1(RNAi) fz0-1(RNAi);dct-1 (RNAi) 


pink-1(RNAi) 


pink-1(RNAi);fzo-1 (RNAi) 


pdr-1(RNAi) fzo-1(RNAi);pdr-1 (RNAi) 


Extended Data Figure 3 | Mitophagy inhibition does not alter intestinal or 
muscle cell mitochondrial network morphology of FZO-1- and DRP-1- 
depleted animals. a, Mitophagy inhibition by RNAi against dct-1, pink-1 and 
par-1 does not affect mitochondrial network morphology in FZO-1- and DRP- 
1-depleted animals that express a mitochondria-targeted GFP in the intestine. 
The mitochondrial network in wild-type animals is well-organized and highly 
connected. By contrast, animals subjected to RNAi against dct-1, pink-1, pdr-1 
and fzo-1 display fragmented and disorganized mitochondrial network 
morphology. Simultaneous knocking down of fzo- 1/dct-1, pink-1/fzo-1 and fzo- 
1/pdr-1 does not alter the already fragmented mitochondrial network of FZO- 
1-deficient worms. DRP-1-depleted animals display disorganized aggregated 
and globular mitochondria compared with wild-type worms. Simultaneous 
knocking down of drp-1/dct-1, pink-1/drp-1 and pdr-1/drp-1 does not change 
the impaired mitochondrial network of DRP-1-depleted animals. Scale bar, 
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20 pum. Images were acquired using a X40 objective lens. b, Transgenic animals 
expressing a mitochondria-targeted GFP in muscle cells subjected to RNAi 
against dct-1, pink-1, pdr-1 and fzo-1. The mitochondrial network in wild-type 
animals is well organized and runs parallel with the myofilament lattice. By 
contrast, DCT-1, PINK-1, PDR-1 and FZO-1 depleted animals display 
fragmented and disorganized mitochondrial network morphology. 
Simultaneous knocking down of fzo-1/dct-1, pink-1/fzo-1 and fzo-1/pdr-1 does 
not alter the already fragmented mitochondrial network of FZO-1-deficient 
worms (n = 120; ***P< 0.001 and NS, P > 0.05; one-way ANOVA). c, DRP- 
1-deficient animals present disorganized and aggregated mitochondrial 
morphology. Simultaneous knocking down of drp-1/dct-1, pink-1/drp-1 and 
padr-1/drp-1 does not change the impaired mitochondrial network of DRP-1- 
depleted animals (n = 100; ***P < 0.001 and NS, P > 0.05; one-way ANOVA). 
Error bars, s.e.m. 
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Extended Data Figure 4 | Mitophagy is induced under several stress 
conditions. a, DCT-1 specifically localizes on the outer mitochondrial 
membrane (shown for muscle cells), where it co-localizes with the 
autophagosomal protein LGG-1/Atg8. Confocal images of body wall muscle 
cells of wild-type animals expressing fluorescently tagged DCT-1::GFP along 
with the autophagosomal marker LGG-1/Atg8 (tagged with DsRed). b, DCT-1 
co-localizes with the E3 ubiquitin ligase PDR-1/Parkin. Confocal images of 
body wall muscle cells in wild-type animals expressing fluorescently tagged 
DCT-1::GFP and PDR-1::DsRed. Scale bar, 20 jum. Images were acquired using 
a X40 objective lens. c, C. elegans transgenic embryos expressing a full-length 
Pige-1GFP::LGG-1 fusion protein indicative of autophagic activity subjected to 
single daf-2, dct-1, pink-1 RNAi or subjected simultaneous daf-2/dct-1 and 
pink-1/daf-2 RNAi (n = 70; NS, P > 0.05, ***P < 0.001; one-way ANOVA). 
d, C. elegans transgenic animals expressing a full-length pjgg.; DsRed::LGG-1 
fusion protein indicative of autophagic activity subjected to single daf-2, dct-1, 
pink-1, pdr-1 RNAi or subjected simultaneous daf-2/dct-1, pink-1/daf-2, daf-2/ 
padr-1 RNAi. Autophagosome number monitored in the pharynx of the animals 
(n = 100; NS, P > 0.05, ***P < 0.001; one-way ANOVA). e, The daf-2(e1370) 
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mutant subjected to RNAi against dct-1, pink-1, pdr-1 and lgg-1 or subjected 
simultaneous against lgg-1/dct-1, pink-1/Igg-1, Igg-1/pdr-1 RNAi. Lifespan 
values are given in Supplementary Table 1; assays were performed at 20°C. 

f, Transgenic animals expressing the mtRosella biosensor in body wall muscle 
cells, were subjected to daf-2(RNAi), treated with paraquat or CCCP and 
exposed to heat stress (37 °C). Mitophagy stimulation is signified by the 
reduction of the ratio between pH-sensitive GFP to pH-insensitive DsRed. 
DCT-1, PINK-1 and PDR-1 are required for mitophagy. Mitophagy is not 
activated under stress conditions when transgenic animals are subjected to 
RNAi against dct-1, pink-1 and pdr-1 (n = 120; NS, P > 0.05, ***P < 0.001; 
one-way ANOVA). Scale bars, 20 jim. Images were acquired using a X10 
objective lens. g, Mitophagy is not further reduced under oxidative stress 
conditions in transgenic animals subjected to RNAi against dct-1, pdr-1 or 
subjected to simultaneous pdr-1/dct-1 RNAi (NS, P > 0.05, ***P < 0.001; one- 
way ANOVA). h, Mitophagy inhibition under stress conditions in transgenic 
animals subjected to RNAi against Igg-1 (NS, P > 0.05, ***P < 0.001; one-way 
ANOVA). Error bars, s.e.m. 
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Extended Data Figure 5 | Monitoring mitophagy in vivo. Transgenic on top, autophagosomes in red below, with a merged image at the bottom). 
animals co-expressing a mitochondria-tagged GFP (mtGFP) in body wall a, DCT-1 and PINK-1 are required for mitophagy. Mitophagy inhibition under 


muscle cells and the autophagosomal protein LGG-1 fused with DsRed, were _ stress conditions when transgenic animals are subjected to RNAi against dct-1 
exposed to heat stress (37 °C), treated with CCCP, paraquat or subjected to and pink-1. Scale bars, 20 jum. Images were acquired using a X40 objective lens. 
RNAi against daf-2. Mitophagy induction is signified by co-localization of GFP __b, High-magnification images showing induction of mitophagy. Scale bars, 
and DsRed signals (for each group of images mitochondria are shown in green _10 lm. Images were acquired using a X40 objective lens. 
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Extended Data Figure 6 | Mitochondrial content is not affected upon 
mitochondrial dysfunction. a, Knockdown of either PHB-1, PHB-2, CLK-1 or 
ISP-1 does not affect the mitochondrial mass in transgenic worms that express a 
mitochondria-targeted GFP in the intestine (n = 100; NS, P > 0.05; one-way 
ANOVA). b, c, Simultaneous RNAi against phb-1/dct-1, phb-1/pink-1, phb-1/ 
par-1, phb-2/dct-1, pink-1/ phb-2, phb-2/pdr-1, isp-1/dct-1, pink-Llisp-1, pdr-1/ 
isp-1, clk-1/dct-1, pink-1/clk-1, clk-1/pdr-1 increases mitochondrial mass 

(n = 100; ***P < 0.001; one-way ANOVA). d, Knockdown of either DCT-1, 
PINK-1 or PDR-1 increases mitochondrial mass in transgenic worms that 
express a mitochondria-targeted GFP in the intestine. Transgenic worms 
expressing mitochondria-tagged GFP in the intestine were raised over one 
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generation in the presence of EtBr to block mitochondrial biogenesis. 
Knockdown of either DCT-1, PINK-1 or PDR-1 increases mitochondrial 
content of animals exposed to EtBr (n = 100; ***P < 0.001; one-way 
ANOVA). e, Knockdown of either PBS-5 or RPN-6 promotes fragmentation of 
mitochondrial network and decreases mitochondrial content in worms that 
express a mitochondria-targeted GFP in the intestine (n = 100; ***P < 0.001; 
one-way ANOVA). Scale bar, 20 jum. Images were acquired using a X40 
objective lens. f-h, The expression levels of ges-1, myo-3 and unc-119 gene are 
relatively stable throughout ageing. Inhibition of autophagy or proteasome 
system does not alter the expression levels of ges-1, myo-3 and unc-119 genes 
during ageing (NS, P > 0.05, **P < 0.01; one-way ANOVA). Error bars, s.e.m. 
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Extended Data Figure 7 | Gene ced-9 acts in the same genetic pathway with 
dct-1, pink-1 and pdr-1 to mediate mitophagy. a, CED-9 depletion does not 
affect the localization of DCT-1. Scale bars, 20 jum. Images were acquired using 
a X63 objective lens. b, Knockdown of either CED-9, DCT-1, PINK-1 or 
PDR-1 increases mitochondrial mass in transgenic worms expressing a 
mitochondria-targeted GFP in the intestine. Simultaneous RNAi against ced-9/ 
dct-1, pink-1/ced-9 and ced-9/pdr-1 does not increase further mitochondrial 
mass (n = 100; NS, P > 0.05, ***P < 0.001; one-way ANOVA). c, Transgenic 
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animals expressing the mtRosella biosensor in body wall muscle cells, were 
treated with paraquat. Mitophagy induction is signified by the reduction of the 
ratio between pH-sensitive GFP to pH-insensitive DsRed. CED-9 is required 
for mitophagy. Mitophagy inhibition under stress conditions when transgenic 
animals are subjected to RNAi against ced-9 (n = 120; NS, P> 0.05, 

***P < 0.001; one-way ANOVA). Error bars, s.e.m. d, Mitophagy inhibition 
under stress, in transgenic animals subjected to RNAi against ced-9. Scale bars, 
20 um. Images were acquired using a X40 objective lens. 
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Extended Data Figure 8 | Inhibition of mitophagy affects longevity in a 
genetic-background-specific manner. a, Depletion of PDR-1, or PINK-1. 
b, c, Depletion of both DCT-1 and PINK-1. d, e, Depletion of both PDR-1 and 
PINK-1, f, Depletion of both DCT-1 and PDR-1, Knockdown of DCT-1 
shortens the lifespan of long-lived clk-1(e2519) (g), and eat-2(ad465) 
(h) mutants. i, j, Mitochormesis is not engaged in mitophagy-deficient animals 
to influence lifespan. Depletion of either DCT-1, PINK-1 or PDR-1, with, or 
without NAC or BHA treatment. k, Depletion of DCT-1, PINK-1 or PDR-1, 
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with, or without EGTA treatment. 1, DCT-1 overexpression does not influence 
the lifespan of otherwise wild-type animals or pink-1(tm1779) mutants under 
normal conditions. DCT-1 overexpression extends only the maximum lifespan 
of DAF-2-depleted worms. However, PINK-1 deficiency abolishes the 
extended lifespan of both daf-2(RNAi) and daf-2(RNAi);Ex|[pact-, DCT-1::GFP] 
animals. Survival curves depict the percentage of animals remaining alive, 
plotted against animal age. Lifespan values are given in Supplementary Table 1; 
assays were performed at 20°C. 
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Extended Data Figure 9 | Mitophagy deficiency impairs mitochondrial 
homeostasis and stress resistance. a, b, Fifteen-day-old dct-1(tm376), pink- 
1(tm1779), pdr-1(gk448) and wild-type animals incubated (a) at 37°C for 7h 
(n = 150; ***P < 0.001; one-way ANOVA) and (b) treated with 8 mM 
paraquat (n = 150; ***P < 0.001; one-way ANOVA). c, Transgenic animals 
expressing the H,O2-biosensor HyPer subjected to either dct-1 or pink-1 RNAi. 
The graph depicts pixel intensity ratios of oxidized to reduced HyPer (n = 70; 
**P < 0.01; one-way ANOVA). d, Mitophagy-depleted animals display 
increased ratios of mitochondrial ROS formation (MitoTracker Red CM-H,X 
ROS) to total mitochondrial content (intestinal mtGFP) (n = 150; 

***P <0).001; one-way ANOVA). e, TMRE staining of mitophagy-depleted 
adults at 20 °C or 25 °C (n = 150; *P < 0.05, ***P < 0.001; one-way ANOVA). 
f, The percentage of mitochondrial DNA relative to wild type is shown. 
Quantitative PCR was performed in wild-type, dct-1(tm376), pink-1(tm1779) 
and pdr-1(gk448) mutant animals. The percentage of mtDNA is not affected in 
mitophagy-deficient animals (NS, P > 0.05; one-way ANOVA). g, The genetic 
locus of dct-1 gene is depicted; dct-1(tm376) is homozygous deletion. The dct- 
1(tm376) mutant carries a deletion of 900 base pairs at the promoter region of 
dct-1 gene. h, i, Transgenic animals were generated carrying extra copies of 
wild-type dct-1 gene and dct-1 gene carrying deletions for the corresponding 
sequences of WXXL motif, MER domain and BH3 domain, and were exposed 
to heat stress (37 °C) or treated with paraquat. WXXL motif of DCT-1 is 
required for mitophagy process since WXXL-deleted DCT-1 fails to rescue 
resistance of dct-1(tm376) animals under stress conditions (n = 150; NS, 

P> 0.05, ***P < 0.001; one-way ANOVA). j, k, Wild-type, pink-1(tm1779) 
and pdr-1(gk448) transgenic animals expressing a full-length pg; DCT-1::GFP 
reporter fusion were exposed to heat stress (37 °C) or treated with paraquat. 
DCT-1 overexpression is sufficient to promote stress resistance in wild-type 
worms. However, the stress resistance of DCT-1 overexpressing animals is 
abolished in pink-1 and pdr-1 mutant background (n = 150; ***P < 0.001; 
one-way ANOVA). Error bars, s.e.m. l-o, DCT-1 is ubiquitinated in a PINK-1- 
dependent manner upon oxidative stress. DTC-1::GFP was 


immunoprecipitated from total lysates of wild-type and pink-1(tm1779) 
transgenic animals expressing a full-length pg.;.;DCT-1::GFP reporter fusion. 
The immunoprecipitated material was trypsinized and subjected to nLC-MS/ 
MS analysis. 1, DCT-1 was identified by mass spectrometry with high 
confidence and a sequence coverage of ~60% (underlined on DCT-1 
sequence). The two most prominent tryptic peptides of DCT-1 (highlighted in 
blue) were used for the relative quantification of the ubiquitination within the 
different samples. Ubiquitin-modified lysine is highlighted in red. m, Numbers 
represent the following samples: untreated wild-type worms (sample 1), 
untreated wild-type transgenic worms expressing a full-length pg_.;DCT- 
1::GFP reporter fusion (sample 2), paraquat-treated wild-type transgenic 
worms expressing a full-length p4_;;DCT-1::GFP (sample 3) and paraquat- 
treated pink-1(tm1779) mutants expressing a full-length pg,;.;DCT-1::GFP 
(sample 4). The most prominently identified tryptic peptides 19-50 (peptide 1) 
and 101-121 (peptide 2) were used to determine the relative abundance of 
modified and unmodified DCT-1 in the four samples. The maximum 
intensities of the peptide ions of each peptide in the different samples were 
normalized and used for the comparative analysis and K26-GG abundance. 
The unmodified peptides showed similar pattern and abundance in the 
different samples (not identified in control sample 1, which did not express 
DCT-1::GEP, as expected), while the tryptic peptide with the K26-GG 
modification (19-50) was significantly enriched in sample 3 where mitophagy 
was induced in the presence of PINK-1. n, 0, K26-GG modification on the 
tryptic peptide 19-50 of DCT-1 was manually validated on the MS/MS spectra 
of the unmodified (n) and modified (0) peptide. Both peptides were identified 
by high amino-acid coverage, as is shown on their corresponding assigned 
spectra. The intact peptide masses after the loss of H2O and NH; molecules are 
shown in green, and the characteristic b and y fragment ions used for 
sequencing the amino-acid residues of the peptides are shown in red and blue, 
respectively. The characteristic fragments indicating the K26-GG modification 
are marked by blue circles and underlined fragment numbers (0). 
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Extended Data Figure 10 | SKN-1 regulates the transcription of 
mitochondrial genes and modulates mitochondrial homeostasis and 
integrity. a, mRNA level analysis of the SKN-1 target genes gst-4 and gst-10 in 
wild-type animals and mitophagy-deficient animals (**P < 0.01; one-way 
ANOVA). b, Expression of the DAF-16-regulated, p,,g.3GFP transgene in 
animals subjected to dct-1 or pink-1 RNAi (n = 70; NS, P > 0.05, ***P < 0.001; 
one-way ANOVA). c, Expression of the mitochondrial stress reporter 
Phsp-ooGFP, in animals subjected to RNAi against dct-1 or pink-1 (n = 70; NS, 
P> 0.05; one-way ANOVA). d, Expression of the ER stress reporter pjsp-4GFP, 
in animals subjected to RNAi against dct-1 or pink-1 (n = 70; NS, P > 0.05; one- 
way ANOVA).e, mRNA levels of atp-5, gas-1, hmg-5, tim-17, WO9C5.8 and dct- 
1 in wild-type and SKN-1-depleted animals (skn-1(zu67) and skn-1(zu135) 
animals) (*P < 0.01; one-way ANOVA). f, The percentage of mitochondrial 
DNA relative to wild type is shown. The percentage of mtDNA is reduced in 
SKN-1-depleted worms (**P < 0.01; one-way ANOVA). g, Knockdown of 
SKN-1 alters mitochondrial network morphology in the intestine of young 
adult animals. Scale bar, 20 j1m. Images were acquired using a X40 objective 
lens. h, mRNA levels of pink-1 and pdr-1 in wild-type animals, DAF-16- and 
SKN- 1-depleted animals (NS, P > 0.05; one-way ANOVA). i, Mitophagy is not 
stimulated under oxidative stress, in transgenic animals expressing mtRosella 
subjected to RNAi against skn-1 (n = 120; NS, P > 0.05, ***P < 0.001; one-way 
ANOVA). j, Monitoring mitophagy under oxidative stress, in transgenic 


4Q_ 
Bi control 230 
OEGTA 10mM 
20 
a 


Ate aio 


50 
control 
EIEGTA 10mM 
Pgst.gGFP 
10 
) 


wen Raho atearanrgn 
on ae A 


ook 


Pixel intensity 
os 


yy 
ates ate wer Se ave see ve 


animals subjected to RNAi against skn-1, skn-1 and dct-1, daf-16, daf-16 and 
dct-1 (n = 120; NS, P > 0.05, ***P < 0.001; one-way ANOVA). k, SKN-1 
compensates for inhibition of mitophagy in otherwise wild-type animals. 
Simultaneous depletion of DCT-1 and SKN-1, or PINK-1 and SKN-1. Survival 
curves depict the percentage of animals remaining alive, plotted against animal 
age. Lifespan values are given in Supplementary Table 1; assays were performed 
at 20 °C. 1, Transgenic animals co-expressing a mitochondria-tagged GFP 
(mtGFP) in body wall muscle cells and the autophagosomal protein LGG-1 
fused to DsRed were exposed to heat stress (37 °C) or treated with CCCP. 
Induction of mitophagy, as indicated by co-localization of GFP and DsRed 
(for each group of images, mitochondria are shown in green on top, 
autophagosomes in red below, with a merged image at the bottom). Mitophagy 
inhibition under stress, in transgenic animals subjected to RNAi against skn-1. 
Scale bars, 20 jim. Images were acquired using a X40 objective lens. 

m, Cytoplasmic calcium elevation upon mitophagy depletion is abolished upon 
treatment with EGTA (n = 100; NS, P > 0.05, ***P < 0.001; one-way 
ANOVA). n, Transgenic animals expressing pg.).4GFP reporter subjected to 
RNAi against dct-1, pink-1 and pdr-1, with or without EGTA treatment 

(n = 120; NS, P > 0.05, ***P < 0.001; one-way ANOVA). 0, UNC-43 regulates 
the transcriptional activity of SKN-1 upon mitophagy depletion (n = 90; 
***P < 0.001; one-way ANOVA). Error bars, s.e.m. 
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Coordinated regulation of bidirectional COPI 
transport at the Golgi by CDC42 


Seung-Yeol Park'”, Jia~-Shu Yang’, Angela B. Schmider”’, Roy J. Soberman*** & Victor W. Hsu!” 


The Golgi complex has a central role in the intracellular sorting of 
secretory proteins’”. Anterograde transport through the Golgi has 
been explained by the movement of Golgi cisternae, known as 
cisternal maturation*°. Because this explanation is now appre- 
ciated to be incomplete’, interest has developed in understanding 
tubules that connect the Golgi cisternae”’. Here we show that the 
coat protein I (COPI) complex sorts anterograde cargoes into these 
tubules in human cells. Moreover, the small GTPase CDC42 reg- 
ulates bidirectional Golgi transport by targeting the dual functions 
of COPI in cargo sorting and carrier formation. CDC42 also 
directly imparts membrane curvature to promote COPI tubule 
formation. Our findings further reveal that COPI tubular trans- 
port complements cisternal maturation in explaining how antero- 
grade Golgi transport is achieved, and that bidirectional COPI 
transport is modulated by environmental cues through CDC42. 

In addition to its known role in generating vesicles’®, we recently 
discovered that COPI also generates tubules that connect the Golgi 
stacks’. However, whether these tubules act passively or actively in 
cargo transport has been unclear. Active transport involves coat pro- 
teins binding to cargoes. Thus, we initially examined whether COPI 
binds to a temperature-sensitive form of the vesicular stomatitis virus 
G protein (hereafter referred as VSVG), which has been widely used to 
track anterograde Golgi transport®°. We found that coatomer, the core 
component of the COPI complex"®, binds directly to the cytoplasmic 
tail of VSVG (Fig. 1a and Extended Data Fig. 1a). Further defining this 
binding, we found that coatomer binds to the membrane-proximal 
region of the VSVG tail (Fig. 1b and Extended Data Fig. 1b), and then 
identified residues within this region crucial for binding by coatomer 
(Fig. 1c and Extended Data Fig. 1a). 

Coatomer has been shown previously to promote the retrograde 
transport of cargoes, such as WBP1 (ref. 11) and the KDEL receptor’, 
which involves binding to their carboxy terminus that contains di- 
lysine residues. By contrast, the basic residues in VSVG recognized 
by coatomer are situated away from this end (see Extended Data Fig. 
la), suggesting a new mode of cargo recognition by coatomer. As 
confirmation, we performed competition studies, and verified that a 
peptide derived from WBP1 (containing the retrograde di-lysine 
motif) cannot compete with the VSVG tail for binding to coatomer 
(Fig. 1d). We also used fluorescence lifetime imaging microscopy 
(FLIM) to confirm that coatomer interacts with VSVG at the Golgi, 
and the interaction requires crucial basic residues in VSVG (Fig. le and 
Extended Data Fig. 1c). Subsequently, we found that mutation of these 
basic residues delayed the transport of VSVG from the endoplasmic 
reticulum (ER) to the trans-Golgi (Fig. 1f and Extended Data Fig. 1d), 
but not from the ER to the cis-Golgi (Extended Data Fig. le). Thus, 
coatomer promotes the intra-Golgi transport of VSVG through cargo 
binding. 

We next examined another anterograde cargo, the endogenous low- 
density lipoprotein receptor (LDLR). Coatomer also binds directly to 
its cytoplasmic tail (Fig 2a), which is also not affected by the presence 


of the WBP1 peptide (Fig. 2b). As the LDLR tail also contains basic 
residues away from the C terminus (Extended Data Fig. 2a), we next 
targeted them for mutagenesis and found that binding to the LDLR tail 
by coatomer became reduced (Fig. 2a). To confirm the functional role 
of this binding, we swapped the tails of LDLR and VSVG, and then 
examined the transport of the resulting chimaera (VSVG-LDLR). 
When crucial basic residues in the LDLR tail were mutated, we 
observed delayed transport of VSVG-LDLR from the ER to the 
trans-Golgi (Fig. 2c and Extended Data Fig. 2b), but not from the 
ER to the cis-Golgi (Extended Data Fig. 2c). 

As compared to VSVG, VSVG-LDLR is transported slower from 
the ER to the cis-Golgi (Extended Data Figs le and 2c), and also 
through the Golgi (Fig 2d and Extended Data Fig. 2d). Pursuing the 
latter observation, we found that coatomer binds more efficiently to 
the VSVG tail than the LDLR tail, both in vitro (Fig. 2e) and in cells 
(Fig. 2f and Extended Data Fig. 3a). Moreover, mutating critical basic 
residues in either cargo tail results in transport rates becoming similar 
(Fig. 2d and Extended Data Fig. 2d). Furthermore, mutating a residue 
in VSVG that partially reduces its binding by coatomer (His494Ala in 
Fig. 1c) moderately reduces VSVG transport through the Golgi 
(Extended Data Fig. 3b). Thus, the collective results further confirmed 
that binding of anterograde cargoes by coatomer promotes their trans- 
port through the Golgi. 

We then sought to address how the same coat complex can 
promote opposing directions of Golgi transport while also maintaining 
specificity in cargo sorting. As a clue, we noted that CDC42 has been 
found previously to interact with coatomer’’. Moreover, the 
CDC42(Phe28Leu) point mutation, which promotes the rapid cycling 
of its GTPase cycle, enhances VSVG transport’. We initially defined 
more precisely that active CDC42(Phe28Leu) promotes the intra- 
Golgi segment of VSVG transport (Fig. 3a and Extended Data Fig. 
4a), with short interfering RNA (siRNA) against CDC42 (Extended 
Data Fig. 4b) having the opposite effect (Extended Data Fig. 4c). By 
contrast, no effect was seen on transport from the ER to the cis-Golgi 
(Extended Data Fig. 4d, e). We then considered that COPI tubular 
transport at the Golgi requires cytosolic phospholipase Az. (CPLA2,) 
activity’. Targeting this activity using siRNA, we found that CDC42 
can no longer promote the anterograde transport of VSVG through 
the Golgi (Fig. 3b and Extended Data Fig. 5a). Next, examining ret- 
rograde COPI vesicular transport, as tracked through a model cargo 
(VSVG-KDELR*"*">), we found that active CDC42(Phe28Leu) inhi- 
bits this transport (Fig. 3c and Extended Data Fig. 5b), whereas CDC42 
siRNA enhances this transport (Extended Data Fig. 5c). Thus, the 
collective results revealed that CDC42 has opposite effects on the 
two directions of COPI transport at the Golgi, promoting anterograde 
tubular transport and inhibiting retrograde vesicular transport. 

We then sought to determine how CDC42 exerts these effects. We 
had previously found that retrograde cargoes are preferentially sorted 
into COPI vesicles generated through a reconstitution system’. 
However, reconstituted COPI tubules contain both anterograde and 
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Figure 1 | Coatomer binds VSVG to promote its transport through the 
Golgi. a-c, In vitro binding of the VSVG tail (full-length (a), truncations 
(b), point mutants (c)) by coatomer; n = 3. kDa, kilodaltons. d, Using the 
WEP!1 peptide that contains the retrograde di-lysine motif (KK) or a di-serine 
motif (SS) for competition in cargo binding by coatomer; n = 4. e, FLIM 
examining VSVG interacting with coatomer; n = 3. ***P < 0.001 (two-tailed 
Mann-Whitney test). WT, wild type. f, Effect of mutating the VSVG tail on its 
transport from the ER to the trans-Golgi; n = 4. Data are mean + s.e.m. 


retrograde cargoes’. Thus, to explore whether CDC42 acts to specify 
cargo sorting into these tubules, we added CDC42 to the reconstitution 
system. Remarkably, the level of a retrograde cargo (VSVG-KDELR) 
in COPI tubules became reduced (Fig. 3d), while that ofan anterograde 
cargo (VSVG) was unaffected (Fig. 3e). In considering how CDC42 
achieves these effects, we noted that CDC42 possesses a di-lysine motif, 
which competes with retrograde cargoes for binding to coatomer”’. 
Pursuing this clue, we found that active CDC42 prevents coatomer 
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Figure 2 | Coatomer also binds to the LDLR tail to promote the transport of 
VSVG-LDLR through the Golgi. a, In vitro binding of the LDLR tail (full- 
length and point mutants) by coatomer; n = 4. b, Using the WBP1 peptides for 
competition in cargo binding by coatomer; n = 3. ¢, Effect of mutating the 
LDLR tail on the transport of VSVG-LDLR from the ER to the trans-Golgi; n = 
4. d, Effect of mutations in cargo tails on the intra-Golgi transport of different 
cargoes; n = 3. e, Titrating coatomer level for in vitro binding to cargo tails; n = 
2. f, FLIM examining different cargoes interacting with coatomer; n = 3. **P< 
0.01, ***P < 0,001 (two-tailed Mann-Whitney test). Data are mean + s.e.m. 
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from binding to the retrograde cargo (KDELR) tail (Fig. 3f), but not 
coatomer binding to the anterograde cargo (VSVG) tail (Fig. 3g). 
These results were further confirmed by FLIM (Extended Data Fig. 
5d, e). We also examined a range of other small GTPases known to act 
in Golgi transport, and found that none of them affects coatomer 
binding to either the VSVG tail (Extended Data Fig. 6a) or the 
KDELR tail (Extended Data Fig. 6b). Other Rho small GTPases also 
did not have an effect (Extended Data Fig. 6a, b). Thus, the collective 
results suggested that CDC42 imparts specificity in cargo sorting into 
COPI tubules by competing with retrograde, but not anterograde, 
cargoes for binding to coatomer. 

Besides cargo sorting, the other major function of coat proteins is 
the generation of transport carriers. Previously, we found that the 
addition of cPLA,,, to the COPI reconstitution system diverts vesicle 
formation to tubule formation’. Remarkably, when we replaced 
cPLA,,, with the active CDC42(Phe28Leu), COPI tubule formation 
was also promoted (Fig. 4a), and at the expense of vesicle formation 
(Fig. 4b). In considering how CDC42 exerts these effects, we initially 
ruled out that CDC42 acts through cPLA,,, as the replacement of wild- 
type cPLA;,, with the catalytic dead form in the COPI reconstitution 
system did not prevent active CDC42 from promoting tubule forma- 
tion (Extended Data Fig. 6c), and also inhibiting vesicle formation 
(Extended Data Fig. 6d). To decipher a different explanation, we noted 
that protein factors can directly impart membrane curvature, as 
reflected by their ability to tubulate liposomes'®’’. Notably, when we 
incubated active CDC42(Phe28Leu) with liposomes, tubulation was 
observed (Fig. 4c, d). 

To determine how CDC42 exerts this effect, we initially considered 
that CDC42 is prenylated, which suggests how the GTPase can insert 
asymmetrically into the membrane bilayer to impart curvature. As 
confirmation, we found that a non-prenylated form of CDC42 cannot 
induce liposome tubulation (Fig. 4c, d). We then considered that 
CDC42 undergoes dimerization’*, which suggested an additional pos- 
sibility that it can polymerize onto membrane to exert curvature 
through scaffolding. Performing mutagenesis, we identified three resi- 
dues at the C terminus of CDC42, anchored by di-arginine residues 
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Figure 3 | CDC42 modulates cargo sorting by COPI. a, b, Effect of different 
conditions on the intra-Golgi transport of VSVG; n = 5. Ctrl, control; si, siRNA. 
c, Effect of expressing active CDC42 on VSVG-KDELR transport; n = 3. 

d, e, Immunogold labelling of reconstituted COPI tubules, with representative 
(out of 15) electron microscopy images and quantification shown; n = 4. ***P 
< 0.001 (two-tailed Student’s t-test); NS, not significant. Scale bars, 50 nm. Data 
are mean = s.e.m. f, g, Effect of adding different CDC42 forms on the binding of 
cargo tails by coatomer; n = 2. 
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Figure 4 | CDC42 modulates carrier formation by COPI. a, Effect of active 
CDC42 on COPI tubule formation, with representative (out of 15) electron 
microscopy images and quantification shown; n = 4. Scale bars, 50 nm. **P < 
0.01 (two-tailed Student’s t-test). b, Effect of active CDC42 on COPI vesicle 
formation; n = 6. **P < 0.01 (two-tailed Student’s t-test). c, d, Effect of mutations 
in CDC42 on its ability to induce liposome tubulation. c, Representative (out 
of 10) electron microscopy images. Scale bars, 500 nm. d, Quantification of 
data in ¢; n = 5. ***P < 0,001 (two-tailed Student’s t-test). e, Effect of 
mutations in CDC42 on its ability to induce COPI tubule formation; n = 4. 
**P < 0.01 (two-tailed Student’s t-test). f, Effect of mutations in CDC42 on 

its ability to promote the intra-Golgi transport of VSVG; n = 3. Data are 
mean = s.e.m. 


(Extended Data Fig. 7a), to be required for dimerization (Extended 
Data Fig. 7b). When the di-arginine residues were mutated, we found 
that active CDC42(Phe28Leu) can no longer tubulate liposomes 
(Fig. 4c, d). As further confirmation, we found that mutating the di- 
arginine residues also prevents CDC42 from promoting tubule forma- 
tion in the COPI reconstitution system (Fig. 4e). Moreover, this muta- 
tion prevents CDC42 from promoting the intra-Golgi transport of 
VSVG (Fig. 4f and Extended Data Fig. 7c). We also considered that 
the di-arginine residues are closely juxtaposed to the di-lysine residues 
in CDC42 (Extended Data Fig. 7a). Thus, we assessed whether the 
di-arginine residues have a role in COPI cargo sorting. However, 
mutation of the di-arginine residues does not prevent activated 
CDC42(Phe28Leu) from interacting with coatomer (Extended Data 
Fig. 7d). Moreover, this mutation also does not alter the ability of active 
CDC42(Phe28Leu) to inhibit coatomer binding to the retrograde 
(KDELR) cargo (Extended Data Fig. 7e). In comparison, mutating 
the di-lysine residues in CDC42 affects this cargo binding by coatomer 
(Extended Data Fig. 7f). These binding results were also further con- 
firmed in cells through FLIM (Extended Data Fig. 7g). 

We next considered that coatomer has been identified as a target 
by which CDC42 promotes cellular transformation’’. Led by this find- 
ing, we found that serum induces similar effects on bidirectional Golgi 
transport as activated CDC42 (Extended Data Fig. 8a, b). Moreover, 
epidermal growth factor (EGF) can replace serum in exerting these 
effects (Extended Data Fig. 8c, d). By contrast, neither serum 
(Extended Data Fig. 8e) nor EGF (Extended Data Fig. 8f) affects trans- 
port from the ER to the cis-Golgi. Mechanistically, serum acts through 
CDC42, as the enhanced anterograde transport of VSVG (Extended 
Data Fig. 8a) and the inhibited retrograde transport of VSVG-KDELR 
(Extended Data Fig. 8b) are prevented after CDC42 siRNA. This was 
also the case for EGF stimulation (Extended Data Fig. 8c, d). 

We then considered that the SRC kinase has also been shown to 
promote anterograde Golgi transport’’. Using siRNA targeting the 
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SRC gene, we found that serum can no longer modulate bidirectional 
Golgi transport (Extended Data Fig. 9a, b). We also found that phar- 
macological activation of SRC reproduces the effects of serum 
(Extended Data Fig. 9c, d). Moreover, CDC42 siRNA prevents SRC 
activation from exerting these effects (Extended Data Fig. 9c, d). Thus, 
we concluded that a signalling cascade, which involves growth factors 
activating SRC and then CDC42, modulates bidirectional Golgi trans- 
port (Extended Data Fig. 10a). Notably, as siRNA against cPLA2,, 
prevents the ability of SRC activation to promote VSVG transport 
through the Golgi (Extended Data Fig. 10b), we further concluded that 
the signalling cascade targets COPI tubular transport in promoting 
anterograde Golgi transport. 

In summaty, we have advanced a new understanding of how Golgi 
transport is achieved and regulated. Our findings also address a con- 
tinuing controversy about the role of COPI in anterograde Golgi trans- 
port. Although this debate has focused on the role of COPI vesicular 
transport’, we provide a reconciling explanation by showing how 
COPI acts anterograde through tubular transport. Another notable 
finding is that CDC42 acts not just as a regulator of cellular 
events, but also as an effector, which in the case of COPI transport 
involves a direct ability to impart membrane curvature. Our results 
also shed new insight into how CDC42 targets coatomer for cellular 
transformation. Whereas CDC42 was previously thought to affect 
COPI transport in the retrograde direction’, our finding that 
CDC42 modulates bidirectional COPI transport to favour the antero- 
grade direction suggests more readily how transformation is pro- 
moted, as the rapid growth and division of tumours is predicted to 
require enhanced secretion. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Chemicals, lipids, peptides and cells. The SRC family activator (Santa Cruz, 
SC3052) has been described”. GDP and GTP were purchased (Sigma-Aldrich). 
DOPC (dioleoylphosphatidylcholine), DOPE (dioleoylphosphatidylethanola- 
mine), DOPS (dioleoylphosphatidylserine), phosphatidic acid and cholesterol were 
also purchased (Avanti Polar Lipid). Peptides encoding the WBP1 tail (wild-type, 
KKLETFKKTN; mutant, KKLETFSSTN) were synthesized (GenScript). Culturing 
of HeLa cells has been described”"*"*. 

Antibodies. Mouse anti-B subunit of coatomer, anti-¢ subunit of coatomer, 
mouse anti-Myc (9E10), sheep anti-TGN46, rabbit anti-giantin antibodies have 
been described previously*'. Rabbit anti-Myc antibody (2278S), rabbit anti- 
CDC42 antibody (2462S), and mouse anti-B-actin antibody (3700P) were 
obtained (Cell Signaling). All secondary antibodies were also obtained (Jackson 
ImmunoResearch Laboratory). 

Purified and recombinant proteins. Preparation of coatomer, ARF1, ARFGAP1, 
BARS, cPLA2, and Golgi membrane have been described’’*'*. To generate 
the prenylated form of human CDC42, pFASTBAC THb (Invitrogen) was used 
to generate baculovirus using Bac-to-Bac Baculovirus Expression system 
(Invitrogen). Infected Sf9 cells were resuspended in hypotonic buffer (20 mM 
sodium borate, pH 10.2, 5 mM MgCl, and protease inhibitor) followed by lysis 
using dounce homogenizer. Membrane-bound CDC42 was collected by 150,000g¢ 
centrifugation for 30 min. The pellet was then resuspended with TBS-M (50 mM 
Tris (pH 7.2), 150 mM NaCl, 5 mM MgCl and protease inhibitor), followed by 
centrifugation at 150,000g for 30 min. Prenylated CDC42 was solubilized from 
membranes using TBS-M containing 1% Triton X-100 for 1 h, followed by cent- 
rifugation at 9,000g for 20 min. The histidine-tagged CDC42 was then purified 
from the supernatant fraction using a nickel column. Non-prenylated CDC42 was 
generated using bacterial expression. The last three amino acids were deleted to 
mimic the protein sequence of the prenylated form (see Extended Data Fig. 7a). 
Human CDC42 in pET-15b (Novagen) was expressed in bacteria (BL21) with 100 
LUM of isopropyl-B-p-thiogalactopyranoside (IPTG) induction at 37 °C for 2 h. 
The His-tagged CDC42 was isolated with a nickel column, and then released with 
elution buffer (50 mM Tris (pH 7.2), 500 mM NaCl, 5 mM MgCl, 0.1% CHAPS 
and 250 mM imidazole). Recombinant forms of RAC1, RhoA and RABIA, 
RAB2A, RAB6A and RAB33B were generated similarly. 

Plasmids. Cytoplasmic domain of cargoes fused to glutathione S-transferase 
(GST) was generated using in vitro annealing. Paired oligonucleotides were 
annealed in buffer (30 mM HEPES-KOH, pH 7.4, 150 mM KCl, 2 mM MgCl) 
at 90 °C for 1 min followed by cooling down. Annealed duplexes were inserted into 
pGEX-4T-3 (GE Healthcare). GST-fused cytoplasmic peptides of LDLR were 
generated using PCR. VSVG-LDLR was generated by fusing the cytoplasmic tail 
of LDLR to the luminal and transmembrane domains of VSVG-ts045. Mutations 
were generated using PCR or QuikChange Site-Directed-Mutagenesis (Strategene). 
VSVG-ts045 and VSVG-ts045-KDELR have been described previously”'. 
CDC42(Phe28Leu) mutant, and the di-lysine mutant of CDC42(Phe28Leu) in 
pFASTBAC THb and PKH3 were gifts from R. A. Cerione. Di-arginine mutant 
of CDC42 was generated using PCR with paired oligonucleotides. Additional 
plasmids were obtained and then subcloned into pET-15b to generate recom- 
binant proteins: RAC1 and RhoA (D. Manor), RAB1A, RAB2A and RAB6A 
(D. Lambright), and RAB33B (M. Fukuda). See Supplementary Information for 
details on the oligonucleotides used to generate chimaeric constructs and point 
mutations. 

Transfections and siRNA. Transfection of DNA plasmids was performed using 
FuGene6 (Roche). Transfection of siRNA was carried out using Lipofectamine 
RNAiMAX (Invitrogen). The siRNA sequence against cPLA2, (targeting the 
PLA2G4A gene) has been described previously’. The siRNA sequences used to 
target human CDC42 (5'-GATAACTCACCACTGTCCA-3’) and human SRC 
(5'-TGTTCGGAGGCTTCAACTCCT-3’) were obtained (Dharmacon). Point 
mutations in CDC42 to render siRNA resistant were generated using oligonucleo- 
tides (5'-GAAAAGTGGGTGCCTGAGATAACTCATCACTGTCCAAAGAC 
TCCTTTCTTGCTITGTTGGG-3’ and 5’-CCCAACAAGCAAGAAAGGAGTC 
TTITGGACAGTGATGAGTTATCTCAGGCACCCACTTTTC-3’). 

In vitro binding assays. GST fusions were expressed in bacteria (BL21 cell) with 
ITPG induction. After lysis in buffer (20 mM Tris, pH 8.0, 100 mM NaCl, 1 mM 
EDTA, 1% Triton X-100, 1 mg ml! of lysozyme and protease inhibitor), GST 
peptide was incubated with glutathione Sepharose beads (GE Healthcare). GST 
fusions on beads were then incubated with purified coatomer (2.5 nM) at 4 °C for 1 
h in incubation buffer (25 mM HEPES, pH 7.2, 50 mM KCl, 2.5 mM magnesium 
acetate and 0.5% NP-40). For competition experiments with peptides, GST fusions 
on beads were co-incubated with coatomer and peptide at 4 °C for 1 h in incuba- 
tion buffer. Beads were then rinsed twice with incubation buffer and then analysed 
by SDS-PAGE followed by immunoblotting or Coomassie staining. For competi- 
tion experiments with small GTPases, GTPases were loaded with either GTP or 


LETTER 


GDP and then incubated with cargo (as GST fusions on beads) and coatomer. 
Quantification was performed by analysing the level of coatomer and normalizing 
for the level of cargo on beads. 

In vivo transport assays. Anterograde transport of VSVG and retrograde trans- 
port of VSVG-KDELR have been described previously””’. In brief, HeLa cells were 
transfected with VSVG-ts045 and VSVG-ts045-KDELR to track anterograde and 
retrograde transport, respectively. To track transport from the ER to the Golgi, 
cells were incubated at 40 °C for 2 h to accumulate a synchronized pool of VSVG at 
the ER. Cells were then shifted to incubation at 32 °C to allow transport from the 
ER to proceed. To examine intra-Golgi transport, cells were incubated at 15 °C for 
2 h to accumulate a synchronized pool of VSVG at the pre-Golgi region, followed 
by a shift to 32 °C to allow transport through the Golgi. To study retrograde 
transport, cells expressing VSVG-ts045-KDELR were incubated at 32 °C for 2 h. 
Cells were then incubated at 40 °C. This prevents further anterograde transport 
from the ER, and thereby allowing one round of retrograde transport from the 
Golgi to the ER to be assessed. In stimulation experiments, cells were starved using 
DMEM supplemented with 0.1% BSA for 2 h followed by stimulation with serum 
or a SRC activator phosphopeptide” (10 1M) for 1 h. Transport was quantified by 
measuring (Metamorph) the colocalization of cargoes with organelle markers over 
time. 

FLIM. Interactions were monitored using time-correlated single-photon 
counting fluorescence lifetime image microscopy analysis (TCSPC-FLIM), 
which has been described previously”*”*. In brief, the different forms of 
VSVG were detected through the Myc tag appended at the C terminus using 
an anti-Myc antibody. This antibody was then detected with a secondary anti- 
body conjugated to Alexa Fluor 594 (acceptor fluorophore). Coatomer was 
detected with the anti-coatomer antibody. This antibody was then detected 
with a secondary antibody conjugated to Alexa Fluor 488 (donor fluorophore). 
The baseline lifetimes of the donor fluorophore were calculated by single-expo- 
nential decay fitting of fluorescence emission in the absence of the acceptor 
fluorophore. For samples that involved staining for both donor and acceptor, 
lifetimes were fitted to a bi-exponential decay with lifetime of one component 
fixed to the donor-only lifetime. Three variables for FLIM were determined: (1) 
lifetime for the interacting fraction, 1), (2) lifetime for the non-interacting 
fraction, T2, and (3) the percentage of interacting molecules, a, (%). Because 
VSVG is transported from the ER to the Golgi, we used this mobility to define 
key physiological parameters in quantifying the interaction between VSVG and 
coatomer (based on the t, values). The maximal interaction was defined by 
quantifying VSVG interacting with coatomer at the Golgi, and minimal inter- 
action was defined by performing quantitation when VSVG is at the ER. 
Interactions of the different forms of VSVG with coatomer were then expressed 
as the fraction of the maximal interaction (coatomer interacting with wild-type 
VSVG at the Golgi). 

COPI reconstitution system. The two-stage incubation system was performed as 
described previously”. In brief, Golgi membrane (250 j1g) was washed with 100 pl 
of 3 M KCI for 5 min, and then diluted with 1 ml of reaction buffer (25 mM 
HEPES-KOH, pH 7.2, 50 mM KCl, 2.5 mM magnesium acetate, 1 mg ml? 
soybean trypsin inhibitor, 1 mg ml~' BSA and 200 mM sucrose). In the first stage, 
the washed membrane was incubated with ARF1, coatomer and GTP at 37 °C for 
10 min. Reaction was stopped on the ice water for 5 min, followed by centrifu- 
gation at 12,000g for 20 min. In the second stage, the pellet was resuspended with 
reaction buffer, and then incubated with ARFGAP1 and BARS for 20 min to 
generate COPI vesicles, or additionally with CDC42 and/or cPLA3,, to generate 
COPI tubules. 

Liposome tubulation assay. Liposomes were generated using a mixture of pure 
lipids (Avanti Polar Lipids): DOPC (33%), DOPS (5%), DOPE (15%), phospha- 
tidic acid (30%) and cholesterol (18%). Lipids (500 1g) in glass tubes were evapo- 
rated under nitrogen gas. Lipids were resuspended with assay buffer (25 mM 
HEPES, pH 7.2, 50 mM KCl, 2.5 mM magnesium acetate and 200 mM sucrose) 
and hydrated overnight. Liposomes were then passed through 400-nm filter mem- 
brane (Whatman) in a mini-extruder (Avanti Polar Lipids). Recombinant CDC42 
was then incubated with liposomes at room temperature for 30 min, followed by 
electron microscopy analysis. 

Electron microscopy analysis. Electron microscopy analysis of Golgi tubules has 
been described previously’. In brief, membrane samples were loaded on electron 
microscopy grids and then fixed with 2% paraformaldehyde (PFA) in PBS for 10 
min. After blocking with 1% BSA in PBS for 10 min, grids were rinsed with water, 
followed by uranyl acetate staining. Golgi tubule was detected using JEOL 
1200EX transmission electron microscope. To detect cargoes in the reconsti- 
tuted tubules, immunogold labelling was performed, which involved incubation 
with mouse anti-Myc antibody (9E10) for 1 h to detect the Myc-tagged form of 
either VSVG or VSVG-KDELR. After rinsing with PBS, the sample was incu- 
bated with a rabbit anti-mouse antibody for 30 min. After an additional rinse 
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with PBS, the sample was further incubated with protein A (conjugated with 10 
nm gold particles). Grids were then washed with water, followed by uranyl 
acetate staining, and then examination by electron microscopy. Fifty electron 
microscopy meshes were examined for quantification in each condition of an 
experiment. 

Gel filtration chromatography. Dimerization of CDC42 was determined using 
Superdex 200 10/300 GL (GE Healthcare), essentially as previously described’®. 
Recombinant CDC42 was applied at a flow rate of 0.5 ml min” ' and monitored by 
ultraviolet detector. 
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3. Data are mean + s.e.m. Scale bars, 5 um. 
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Extended Data Figure 4 | Characterizing how CDC42 affects VSVG 
transport. a, Effect of expressing active CDC42 on the intra-Golgi transport of 
VSVG. Colocalization is exemplified by line scanning across representative 
images (out of 5). b, Immunoblotting of whole-cell lysates to assess efficiency of 
siRNA treatment, n = 3. ¢, Effect of siRNA against CDC42 on the intra-Golgi 
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representative images (out of 5). Quantification is also shown, n = 4. d, Effect of 
expressing active CDC42 on VSVG transport from the ER to the cis-Golgi. 

Representative images of colocalization are shown (out of 5). Quantification is 
also shown, n = 5. e, Effect of siRNA against CDC42 on VSVG transport from 
the ER to the cis-Golgi. Representative images of colocalization are shown (out of 
5). Quantification is also shown, n = 4. Data are mean + s.e.m. Scale bars, 5 um. 
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Extended Data Figure 5 | Further characterizing the effects of CDC42. 

a, Effect of various conditions on the intra-Golgi transport of VSVG. 
Colocalization is exemplified by line scanning across representative images (out 
of 5). b, Effect of expressing active CDC42 on the transport of VsVG-KDELR. 
Representative images of colocalization is shown (out of 5). c, Effect of siRNA 
against CDC42 on the transport of VSVG-KDELR. Representative images of 


colocalization are shown (out of 5). Quantification is also shown, n = 3. 

d, e, FLIM assessing the effect of active CDC42 on the interaction between 
different VSVG forms and coatomer. Representative images that are pseudo- 
coloured based on 1, values are shown (out of 10). Quantification is also shown 
as table and as graph, n = 3. ***P < 0.001 (two-tailed Mann-Whitney test). 
Data are mean + s.e.m. Scale bars, 5 um. 
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Extended Data Figure 6 | Characterizing how CDC42 affects COPI cargo 
sorting and carrier formation. a, b, Effect of different small GTPases on the in 
vitro binding of cargo tails by coatomer. Representative blots are shown (out of 


3). Quantification is also shown, n = 3, mean + s.e.m. *P < 0.05 (two-tailed 
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Student’s t-test). c, d, Effect of different active forms of CDC42 on tubule and 
vesicle formation in the COPI reconstitution system, n = 4, mean + s.e.m. 
**P < 0.01 (two-tailed Student’s t-test). 
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Extended Data Figure 7 | Delineating the role of di-arginine residues in 
CDC42. a, Amino acid sequence at the C terminus of CDC42. The last three 
residues are cleaved after prenylation. b, The di-arginine residues in CDC42 are 
required for dimerization. Representative result from gel filtration (out of 2) is 
shown. ¢, Effect of different conditions on the intra-Golgi transport of VSVG. 
Colocalization is exemplified by line scanning across representative images (out 
of 5). Scale bar, 5 um. d, Effect of mutations in CDC42 on its ability to bind 


coatomer in vitro. Representative blot (out of 2) is shown. e, Effect of mutations 
in CDC42 on its ability to compete with cargo tails for binding to coatomer in 
vitro. Representative blot (out of 3) is shown. f, Effect of different forms of 
CDC42 to compete with the KDELR tail for binding to coatomer in vitro. 
Representative blot (out of 2) is shown. g, FLIM examining VSVG-KDELR 
interacting with coatomer, n = 3. Data are mean + s.e.m. ***P < 0.001 
(two-tailed Mann-Whitney test). 
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Extended Data Figure 8 | Delineating how external stimuli regulate 
bidirectional Golgi transport. a, Effect of different conditions on the intra- 
Golgi transport of VSVG. Colocalization is exemplified by line scanning across 
representative images (out of 5). Scale bar, 5 jtm. Quantification is also shown, 
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Extended Data Figure 9 | Delineating how SRC regulates bidirectional 
Golgi transport. a, c, Effect of different conditions on the intra-Golgi transport 
of VSVG. Colocalization is exemplified by line scanning across representative 
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Extended Data Figure 10 | Further characterizing how bidirectional COPI 
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30 min 


Intensity (%) 
oe 8 8 e 
| 

ES 


VSVG colocalization 


180 


a 
Distance (pixels) Distance (pixels) 


0 ato 


mo cr 
Distance (pixels) Distance (pixels) 


Intensity (%) 
oe he a 8 
| 

. 


0 a 0 7% wo 


Distance (pixels) Distance (pixels) 


©2015 Macmillan Publishers Limited. All rights reserved 


-s Starvation 
-- SRC activation 
100-5 -=- SRC activation/ si cPLAz« 


3 75 ye i YS 
o og — " 
¢ a 

5 50 YS aw 
- oa, 

S25, = fr 
S ZA 


0+ 1 1 1 
0 10 20 30 


Time (min) 
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exemplified by line scanning across representative images (out of 5). Scale bar, 
5 tm. Quantification is also shown, n = 3. Data are mean = s.e.m. 
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Defining fundamental steps in the assembly of the 
Drosophila RNAi enzyme complex 


Shintaro Iwasaki!+*, Hiroshi M. Sasaki“, Yuriko Sakaguchi’, Tsutomu Suzuki’, Hisashi Tadakuma? & Yukihide Tomari’? 


Small RNAs such as small interfering RNAs (siRNAs) and microRNAs 
(miRNAs) silence the expression of their complementary target 
messenger RNAs'” via the formation of effector RNA-induced 
silencing complexes (RISCs), which contain Argonaute (Ago) 
family proteins at their core. Although loading of siRNA duplexes 
into Drosophila Ago2 requires the Dicer-2-R2D2 heterodimer** and 
the Hsc70/Hsp90 (Hsp90 also known as Hsp83) chaperone 
machinery*®*, the details of RISC assembly remain unclear. Here 
we reconstitute RISC assembly using only Ago2, Dicer-2, R2D2, 
Hsc70, Hsp90, Hop, Droj2 (an Hsp40 homologue) and p23. By 
following the assembly of single RISC molecules, we find that, in 
the absence of the chaperone machinery, an siRNA bound to Dicer- 
2-R2D2 associates with Ago2 only transiently. The chaperone 
machinery extends the dwell time of the Dicer-2-R2D2-siRNA 
complex on Ago2, in a manner dependent on recognition of the 
5’-phosphate on the siRNA guide strand. We propose that the 
chaperone machinery supports a productive state of Ago2, allowing 
it to load siRNA duplexes from Dicer-2-R2D2 and thereby 
assemble RISC. 

Because both miRNAs and siRNAs are generated as double-stranded 
RNAs—miRNA/miRNA* duplexes and siRNA duplexes—but mature 
RISCs contain only single-stranded guide RNAs, RISC assembly involves 
at least two steps: loading of a small RNA duplex into Ago (forming pre- 
RISC) and ejection of the passenger strand from Ago (forming mature 
RISC)'”. Even when initiated by ‘pre-diced’ siRNA duplexes rather 
than long double-stranded RNAs, assembly of Drosophila Ago2-RISC 
requires Dicer-2, an RNase III enzyme that processes long double- 
stranded RNAs into siRNA duplexes, and its partner protein R2D2 
(refs 3-5). In contrast, Dicer or its partner proteins are not essential 
for assembly of fly Agol-RISC’ and mouse and human AGO2- 
RISC’**. Regardless of Dicer dependence, loading of small RNA 
duplexes into Ago requires ATP*”** and is blocked by Hsp70 or 
Hsp90 inhibitors**"*. On the other hand, subsequent ejection of the 
passenger strand from Ago proceeds without ATP or chaperones”””®, 
but is accelerated by cleavage of the passenger strand by Ago’”" and 
the endonuclease C3PO”*””. 

The Hsc70 and Hsp90 chaperones generally act together with other 
multiple proteins, which as a large complex help the client proteins achieve 
an active conformation’’. For example, maturation of mammalian 
steroid hormone receptors to an active ligand-binding form requires 
HSC70, HSP90, HOP, HSP40 and p23 (ref. 21). Given that Ago2 
ligands are siRNA duplexes, RISC assembly is viewed analogously to 
steroid hormone receptor maturation. Thus, we asked whether RISC 
assembly could be reconstituted using fly Ago2, Dicer-2-R2D2 hetero- 
dimer and recombinant chaperone factors. As an in vitro assay system, 
we immunopurified Flag—Ago2 from Drosophila S2 cells (Extended 
Data Fig. la), and then incubated the immunoprecipitate with a 
siRNA duplex containing a **P-radiolabelled guide strand. On its 
own, immunopurified Ago2 failed to bind siRNA (Extended Data 


Fig. 1b); when supplemented with lysate from ago2*'* mutant fly 
embryos (Extended Data Fig. 1c), Ago2 loaded siRNA duplex and 
ejected the passenger stand, forming mature Ago2-RISC containing 
single-stranded guide (Extended Data Fig. 1b). In contrast, mutant 
Ago2 bearing a Y897E substitution in the pocket that recognizes the 
phosphate and the base of the guide 5’ nucleotide” failed to load siRNA 
even in the presence of lysate (Extended Data Fig. 1b, d). A D965A 
catalytic mutant of Ago2 that cannot cleave the passenger strand’? 
loaded siRNA duplex but failed to mature into Ago2-RISC 
(Extended Data Fig. 1b, d). As expected, duplex loading was blocked 
by chaperone inhibitors (Extended Data Fig. le and Supplementary 
Discussion). 

To reconstitute RISC assembly, we first purified the three core com- 
ponents of the chaperone machinery—Hsc70-4, Hsp83 and Hop—using 
affinity-tagged Hop”* (Extended Data Fig. 1a). Although Ago2-RISC 
assembly was undetectable with Dicer-2-R2D2 alone, supplementing 
Dicer-2-R2D2 with Hsc70-4, Hsp83 and Hop supported formation of 
pre- and mature Ago2-RISC (Fig. 1a and Extended Data Fig. 2a). Ago2- 
RISC assembly was further augmented by addition of recombinant 
Droj2 or p23 (Fig. la and Extended Data Figs la, 2a). To dissect the 
contribution of each of the seven proteins to the reaction, we indivi- 
dually purified recombinant Hsc70-4, Hsc83 and Hop (Extended Data 
Fig. 1a). Together, all seven proteins—Dicer-2-R2D2, Hsp83, Hsc70-4, 
Hop, Droj2 and p23—directed efficient Ago2—RISC assembly (Fig. 1b 
and Extended Data Fig. 2b). Omitting any of the five chaperone factors 
reduced the efficiency of RISC assembly by ~70-90% (Fig. 1b and 
Extended Data Fig. 2b). Reconstituted Ago2—RISC was as fully active 
for target cleavage as the Ago2-RISC programmed in lysate (Fig. 1c 
and Extended Data Fig. 2c-f). Thus, Dicer-2-R2D2 plus five chaper- 
one proteins suffice to assemble siRNA duplexes into Ago2-RISC. In 
contrast to canonical duplex loading, loading ofa single-stranded RNA 
into Ago2 bypassed the requirement of Dicer-2-R2D2 or the chaper- 
ones (Extended Data Fig. 2g, h). 

Although it was reported that complementary target RNAs can pro- 
mote release of guide strands”, RISC is extremely stable in general. In 
fact, recombinant Ago proteins always co-purify with endogenous RNAs 
from the host cells*°, and our fly Ago2 preparation is largely preoccu- 
pied with endogenous siRNAs. If the chaperone machinery facilitates 
duplex loading, it may do so by releasing preloaded RNAs to increase 
the pool of empty Ago. However, loading of exogenous let-7 siRNA 
duplex into fly Ago2 in the reconstituted system was not accompanied 
by the removal of the endogenous esi-2.1 (also known as esi-2) RNA 
(Fig. 1d and Extended Data Fig. 3a) or the in vitro preloaded CXCR4 
siRNA (Extended Data Fig. 3b-d). We conclude that the chaperones 
support an RNA-free form of Ago2 to load siRNA duplexes without 
emptying preassembled Ago2-RISC. 

In the fly Ago2—RISC assembly pathway, Dicer-2-R2D2 senses the 
siRNA duplex thermodynamic asymmetry, thereby fixing the orienta- 
tion of the duplex transfer to Ago2 (ref. 27). Accordingly, the strand with 
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Figure 1 | In vitro reconstitution of Ago2-RISC assembly. a, Pull-down 
assay using the three core chaperone components and recombinant Dicer-2- 
R2D2, Droj2 and p23. b, Pull-down assay with the seven purified recombinant 
proteins. c, Target cleavage after 30-min reaction in the reconstituted 

system. Data represent mean + standard deviation (s.d.) (n = 3). d, Loading 
of exogenous let-7 siRNA does not accompany unloading of endogenous esi-2. 1 
RNA. RNAs in Ago2 were detected by northern blotting. e, The reconstituted 
system faithfully recapitulates the siRNA loading asymmetry. Ago2-RISC 
was assembled with an siRNA with either the guide or the passenger strand 
radiolabelled. f, ATP hydrolysis by both Hsc70-4 and Hsp83 is required for 
maximum Ago2-RISC assembly. WT, wild type. 


the less thermodynamically stable 5’ end is predominantly selected 
as the guide strand'*. When RISC assembly was initiated in the recon- 
stituted system using a functionally asymmetric siRNA duplex with 
either the guide strand or the passenger strand radiolabelled, both 
strands were detected in pre-Ago2-RISC, but only the guide strand 
was detected in mature Ago2-RISC (Fig. le and Extended Data Fig. 4a). 
Thus, the reconstituted system faithfully recapitulates siRNA loading 
asymmetry. 

ATP dependence is a hallmark of RISC assembly initiated by small 
RNA duplexes*””"». In the seven-protein reconstituted system, the non- 
hydrolysable ATP analogues ATP-yS, AMP-PNP and ADP all inhibited 
duplex loading (Extended Data Fig. 4b, c). Two of the five chaperone 
factors are ATPases (Supplementary Discussion), so we tested the ATPase- 
defective mutants Hsc70-4(D206S)”** and Hsp83(E35A)~’. Both mutant 
proteins reduced duplex loading efficiency in the reconstituted system, 
suggesting that ATP hydrolysis by both Hsc70-4 and Hsp83 is required 
for maximum RISC assembly (Fig. 1f and Extended Data Fig. 4d). 

Because the biochemical approach cannot detect intermediate states 
during the handover of the siRNA duplex from Dicer-2—R2D2 to Ago2, 
we performed single-molecule imaging of RISC assembly using total 
internal reflection fluorescence (TIRF) microscopy to dissect how the 
chaperone machinery assists duplex loading (Fig. 2a and Extended Data 
Fig. 5)°°. We expressed amino-terminal Halo-TEV-Flag-tagged fly Ago2, 
in which a cleavage site for tobacco etch virus (TEV) protease was inserted 
between the Halo and Flag tags, in S2 cells (Extended Data Fig. 6a). The 
Halo-TEV-Flag-tagged Ago2 was captured on a NeutrAvidin-derivatized 
quartz glass surface via Halo-ligand-conjugated biotin. After washing of 
the surface, we added the reconstituted system or equimolar glutathione 
S-transferase (GST) as a control, together with an siRNA with a 3’ 
Alexa647 (red)-labelled guide strand anda 3’ Alexa555 (green)-labelled 
passenger strand (Fig. 2a). The siRNA duplex with these 3’ -fluor modifi- 
cations efficiently formed pre- and mature Ago2-RISC in vitro (Extended 
Data Fig. 6b). 
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Figure 2 | Single-molecule observation of pre- and mature RISC formation. 
a, Schematic representation of the single-molecule analysis. b, c, Representative 
images (b) and quantification (c) of wash-resistant spots after 1h incubation. 
Yellow, double-stranded signals; red, guide-alone signals; green, passenger- 
alone and background signals. Mock, no Ago2 tethered. Cpn, chaperone 
machinery; D/R, Dicer-2-R2D2; WT, wild type; Cmut, catalytic mutant 
(D965A). Scale bar, 2 um. Data represent mean + s.d. (WT, +D/R, +Cpn, 

n = 6; others, n = 3). 


When the surface-tethered Ago2 and the seven-protein reconstituted 
system were present, many fluorescent spots were observed (Extended 
Data Fig. 6c); after 1h incubation, the surface was washed and the 
remaining spots were analysed (Extended Data Fig. 5a and Supplemen- 
tary Discussion). Two subpopulations were seen for guide-containing 
spots: one with the guide strand alone (red spots) and the other with both 
guide and passenger strands (colocalization of red and green signals, 
giving yellow spots) (Fig. 2b, c). These spots were observed only in the 
presence of Dicer-2-R2D2, the chaperone machinery and the surface- 
tethered Ago2; when Ago2 was released from the glass surface by TEV- 
protease-mediated cleavage (Extended Data Figs 5b and 6d) few yellow 
and red spots remained (Extended Data Fig. 6e, f). Moreover, the cat- 
alytic mutant Ago2, which is defective in strand separation (Extended 
Data Fig. 1b), predominantly produced yellow spots (Fig. 2b, c). These 
data suggest that red and yellow wash-resistant spots represent mature 
RISC and pre-RISC, respectively. In contrast, the number of green only 
spots was similar in any condition, suggesting that many of them are 
background signals that are commonly observed with green laser exci- 
tation (Fig. 2c). Supporting this idea, inverting the strand colours (that 
is, green-labelled guide and red-labelled passenger) resulted in accu- 
mulation of green spots in wild-type Ago2 and yellow spots in catalytic 
mutant Ago2 (Extended Data Fig. 6g, h) with few red spots. Our single- 
molecule assay, like the pull-down assay, thus faithfully recapitulates 
canonical RISC assembly. 

Next, we continuously monitored the appearance of the spots during 
the first 20 min without washing (Extended Data Fig. 5c and Sup- 
plementary Discussion). Given that the siRNA duplex is loaded into 
Ago2 as double strands, we focused on the events where the red and 
green spots appeared at the same time. In many cases, the red and green 
spots disappeared simultaneously within the 20-min observation window 
(Fig. 3a, top; ~86%). In some cases, the two colours remained present 
until the end of the observation period (Fig. 3a, middle; ~9%), the green 
spots disappeared earlier than the red spots (Fig. 3a, bottom; ~3%), or 
vice versa (data not shown; ~1%). Interestingly, the number of total 
binding events was comparable in the presence or absence of the chap- 
erone machinery, but required the presence of Dicer-2-R2D2 and 
surface-tethered Ago2 (Fig. 3b). Moreover, the distribution of the spot 
appearance duration showed a similar single-exponential function with 
atime constant (t) of ~20 s, regardless of the presence of the chaperone 
machinery (Fig. 3c and Extended Data Fig. 7a). Thus, the chaperone 
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Figure 3 | The chaperone machinery extends the dwell time of Dicer-2- 
R2D2-siRNA on Ago2. a, Representative traces that start with simultaneous 
binding of red and green spots during the 20-min monitoring period. AU, 
arbitrary units. b, Total event numbers during the continuous monitoring. 
Dicer-2-R2D2 (D/R), but not the chaperone machinery (Cpn), was required 
for the spots to appear. Wild type (WT)+TEV, Ago2 was detached from the 
glass surface before monitoring (Extended Data Fig. 5c). c, Dwell time 
histograms. The chaperone machinery was dispensable for the transient 
binding (~20s). Solid line, fitted from the cumulative plot (Extended Data 
Fig. 7a). d, Numbers of long binding events (>100s). Yellow, simultaneous 
dissociation of red and green spots or their continuous binding to the end of 
observation; red, dissociation of the green spots with the red spots remaining 
(yellow-to-red); green, dissociation of the red spots with the green spots 
remaining (yellow-to-green). WT +TEV, detachment of Ago2 before 
monitoring. e-h, Dwell-time distribution for the yellow-to-red or yellow-to- 
green transitions in wild-type (e, f) and catalytic mutant Ago2 (g, h). The 
yellow-to-red duration in wild-type Ago2 (e) fits to a two-state reaction model 
and differs significantly from those of the other data (f-h, P = 0.00025, 0.024, 
and 0.017, respectively; two-sample Anderson-Darling test), which showed 
single-exponential decays. Solid line, fitted from the cumulative plot (Extended 
Data Fig. 7b-e). Data in b, d represent mean + s.d. (WT, +D/R, +Cpn, n = 5; 
others, n = 3). 


machinery is dispensable for the short dwelling of Dicer-2-R2D2-bound 
siRNA on Ago2. In contrast, long binding events (>100 s) require Ago2, 
Dicer-2-R2D2 and the chaperone machinery (Fig. 3d). When all factors 
were present, ~ 18% of long binding events showed the disappearance 
of the green spots before the red spots, which presumably reflects the 
formation of mature RISC via ejection of the passenger strand from 
Ago2 (Fig. 3d, red). 

To determine the time constant for RISC maturation, we further 
analysed the spots that displayed the yellow-to-red transition. The dis- 
tribution of the dwell time before the disappearance of the green spots 
showed a clear peak and fitted well to a two-step model with time con- 
stants of ~197s and ~194s (Fig. 3e and Extended Data Fig. 7b), indi- 
cating the involvement of two or more consecutive first-order reactions. 
This is in line with the model that passenger-strand ejection occurs only 
after the initial, transient binding between Dicer-2-R2D2-siRNA and 
Ago2, extension of the dwell time by the chaperone machinery, and the 
cleavage of the passenger strand. In contrast, the spots that displayed 
the opposite, yellow-to-green transition (Fig. 3d, green), showed a single- 
exponential decay of the duration (Fig. 3f and Extended Data Fig. 7c), 
presumably reflecting the photobleaching of the red fluorophore, rather 
than ‘flipped’ RISC maturation. Moreover, in the catalytic mutant 
Ago2, which should retain the guide and passenger strands, both red 
and green spots showed single-exponential decays (Fig. 3g, h and 
Extended Data Fig. 7d, e). We conclude that the chaperone machinery 
acts to extend the dwell time of the Dicer-2-R2D2-siRNA complex on 
Ago2, thereby facilitating the siRNA handover from Dicer-2-R2D2 to 
Ago2 that leads to RISC maturation. 
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The 5’-phosphate on the guide strand is thought to be crucial for Ago2 
to anchor the guide strand via the 5’ nucleotide-binding pocket*>”*. 
Indeed, a 5’-hydroxyl group on the guide strand, which did not affect 
siRNA binding to Dicer-2-R2D2 (Supplementary Discussion), blocked 
formation of pre- and mature Ago2-RISC without altering the func- 
tional asymmetry of the two strands (Extended Data Fig. 8a—c). This 
was also confirmed by the single-molecule analysis; we observed few 
wash-resistant spots when the fluorescently labelled siRNA duplex bore 
a 5’-hydroxyl group on the guide strand or when the 5’ nucleotide- 
binding pocket of Ago2 was mutated (Fig. 4a, b). Intriguingly, however, 
continuous monitoring without washing showed that the number of 
total events for the 5’-hydroxyl siRNA or the 5’ nucleotide-binding 
pocket mutant of Ago2 was comparable with that for the 5’-phosphate 
siRNA and wild-type Ago2 (Fig. 4c). The distribution of the dwell time 
fitted to similar single exponential curves with a time constant of ~20 s, 
regardless of recognition of the 5'-phosphate (Figs 3c, 4d and Extended 
Data Figs 7a, 8d). In contrast, the long binding events that lead to RISC 
maturation were much less frequent in the absence of 5’-phosphate 
recognition (Fig. 4e). Thus, 5’-phosphate recognition by Ago2 is dis- 
pensable for the transient binding between Dicer-2-R2D2-siRNA and 
Ago2, but is required for the extension of the dwell time. 

Our data define the following multiple fundamental steps in Ago2- 
RISC assembly (Extended Data Fig. 9). A Dicer-2-R2D2-bound siRNA 
duplex associates with but rapidly dissociates from unchaperoned Ago2. 
The chaperone machinery extends the dwell time of the Dicer-2-R2D2- 
siRNA complex on Ago? to facilitate siRNA handover, but this extension 
requires the recognition of the guide 5’-phosphate by Ago2 (Extended 
Data Fig. 9). Previous studies suggest that, although each individual 
domain of Ago appears structurally rigid, the overall domain organi- 
zation of an RNA-free state of Ago is highly flexible, to which small 
RNA binding confers remarkable stability'®’°**?°. We envision that the 
Hsc70/Hsp90 chaperone machinery has a critical role in supporting a 
loading-competent state of Ago2, in which the 5’ nucleotide-binding 
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Figure 4 | Recognition of the guide 5'-phosphate is required for dwell-time 
extension. a, b, Representative images (a) and quantification (b) of wash- 
resistant spots after incubation. 5’-Phosphate recognition was required for the 
formation of pre- and mature RISC. 5’-OH, 5’-hydroxyl group on the guide 
strand; 5'-P, 5'-phosphate; Pmut, the 5’ nucleotide-binding pocket mutant 
(Y897E); WT, wild type. Scale bar, 2 um. The wild-type:5’-phosphate data 
are the same as in Fig. 2b, c. c, Total event numbers during continuous 
monitoring. 5’-Phosphate recognition was not necessary for the spots to 
appear. The wild-type:5’-phosphate data are the same as in Fig. 3b. d, Dwell 
time histograms. 5’-Phosphate recognition was not necessary for transient 
binding. Solid line, fitted from the cumulative plot (Extended Data Fig. 8d). 
e, Numbers of long binding events (>100s). 5’-Phosphate recognition was 
required for dwell-time extension. The wild-type:5’-phosphate data are the 
same as in Fig. 3d. Data in b, c, e represent mean + s.d. (n = 3). 
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pocket is accessible to the 5’-phosphate of the guide strand of the 
Dicer-2-R2D2-bound siRNA duplex (Extended Data Fig. 9). Proper 
anchoring of the 5'-phosphate would trigger loading of the rest of the 
duplex, leading to successful RISC maturation coupled with closing and 
stabilization of Ago2. A future challenge will be to monitor directly a 
series of such dynamic conformational changes of Ago proteins. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

General methods. Lysis buffer, lysates from Drosophila melanogaster embryos 
and 82 cells, radiolabelled small RNA duplexes, formamide dye, and cap-labelled 
target RNAs were prepared as previously described*' °°. Native gel analysis of double- 
stranded and single-stranded RNAs and gel shift assay were performed as previ- 
ously described’!~**. Small RNAs shown in Fig. le and Extended Data Fig. 4a were 
purchased from GeneDesign. For single-molecule analysis, we used 5’-pUGAGG 
UAGUAGGUUGUAUAGU-3' (let-7, guide strand), 5'’-UGAGGUAGUAGGU 
UGUAUAGU-3' (let-7, guide strand, 5'-OH), and 5'-pUAUACAACCUACUAC 
CUCUCU-3’ (passenger strand) with chemical labelling of Alexa647 or 555 through 
a C6-amino linker at the 3’ end purchased from GeneDesign. The siRNA duplexes 
borea single unpaired nucleotide at the 5’ end of the guide strand, making it highly 
asymmetric. 

Plasmid constructions. pCold-Hsp83, Hsp83(E35A), Hsc70-4, Hsc70-4(D206S), 
Hop, Droj2 and p23. DNA fragments containing hsp83, hop, droj2, p23 (cg16817), 
hsc70-4 or hsc70-4?7°% genes were amplified by PCR from S2 cell CDNA, pENTR 
WT Hsc70-4 or pENTR Hsc70-4(D206S) (ref. 7) and cloned into pColdI (Takara) 
with NdeI-EcoRI or Ndel-HindIII restriction enzyme sites. E35A mutation in 
hsp83 gene was introduced by QuickChange site-directed mutagenesis 
(Stratagene). 

pASW-Hop. A DNA fragment containing the hop gene was amplified from S2 cell 
cDNA and cloned into pENTR/D-TOPO (Invitrogen) according to the manufac- 
turer’s instruction, followed by recombination with pASW destination vector’ with 
Gateway LR Clonase II (Invitrogen). 

pAFW-Ago2 wild type, D965A and Y897E. pAFW-Ago2 wild type was described 
previously’. D965A and Y897E mutations were introduced by QuikChange site- 
directed mutagenesis (Stratagene). 

pAHaloFW-Ago2 wild type, D965A and Y897E. A DNA fragment containing the 
Halo tag and TEV protease recognition sequence from pFN18A (Promega) was 
inserted into pAFW (Drosophila Gateway vector collection, Actin 5C promoter 
and N-terminal 3 Flag tag) at the 5’ end upstream of the 3 Flag-tag sequence by 
In-Fusion HD (Clontech), to construct the pAHaloFW destination vector. pENTR- 
Ago2 wild type was a gift from M. Horwich and P. D. Zamore pDONR-Ago2 
D965A and Y897E were constructed by recombination with pDONR/Zeo 
(Invitrogen) and pAFW-Ago2 D965A or Y897E by BP Clonase II (Invitrogen). 
pENTR-Ago2 wild type, PDONR-Ago2 D965A and Y897E were recombined with 
pAHaloFW to construct pAHaloFW-Ago2 wild type, D965A and Y897E, respect- 
ively. 

pAWH-Dicer-2 and pASW-R2D2. DNA fragments containing the dicer-2 or r2d2 
gene were amplified by PCR from pFastBac-Dicer-2 or pFastBac-R2D2 (refs 3, 27), 
which were gifts from Q. Liu, and cloned into pENTR/D-TOPO (Invitrogen) or 
pENTR/TEV/D-TOPO (Invitrogen), followed by recombination with pAWH 
(Drosophila Gateway vector collection) or pASW destination vector’ with Gate- 
way LR Clonase II (Invitrogen). 

DNA oligonucleotides for plasmid construction are as follows. Hsp83, 5’-AAA 
CCCCATATGATGCCAGAAGAAGCAGAGA-3' and 5'-AAACCCGAATTCTT 
AATCGACCTCCTCCATG-3’; Hsc70-4, 5'-AAACCCCATATGATGTCTAAA 
GCTCCTGCTG-3’ and 5’-AAACCCGAATTCTTAGTCGACCTCCTCGATG-3’; 
Hop for pColdI, 5’-AAACCCCATATGATGGACAAGGTGAACGAAC-3’ and 
5'-AAACCCGAATTCCTAGTGAATCTGAATGATGCC-3’; Droj2, 5’-AAACC 
CCATATGATGATGGTTAAGGAGACTGGATA-3’ and 5’-AAACCCGAATT 
CTTAACTCGATGTGCACTGC-3’; p23, 5’-AAACCCCATATGATGTCGGCA 
GCAGCAGG-3' and 5'-AAACCCAAGCTTCTAGGCAGCTGGCTTCTT-3’; Hsp83 
E35A, 5’-GGAGATTTTCCTGCGCGCGTTGATCTCGAACGCTTC-3’ and 5'-G 
AAGCGTTCGAGATCAACGCGCGCAGGAAAATCTCC-3’; Hop for pENTR, 
5'-CACCATGGACAAGGTGAACGAAC-3’ and 5'-CTAGTGAATCTGAATG 
ATGCC-3'; Ago2 D965A, 5'-ACACCATGTACATTGGAGCCGCCGTGACCC 
ATCCCTCTCCCGA-3’ and 5’-TCGGGAGAGGGATGGGTCACGGCGGCTC 
CAATGTACATGGTGT-3’; Ago2 Y897E, 5'-ATTCCTCAGTTTAGAATTTCT 
GAAGATACAATTAAGCAGAAGGCC-3’ and 5'-GGCCTTCTGCTTAATTG 
TATCTTCAGAAATTCTAAACTGAGGAAT-3’; Dicer-2, 5’-CACCATGGAA 
GATGTGGAAATCAA-3’ and 5'-TTAGGCGTCGCATTTGCT-3’; and R2D2, 
5'-CACCATGGATAACAAGTCAGCCG-3’ and 5’-TTAAATCAACATGGTG 
CGAAA-3’. 

Protein purifications. Recombinant proteins including Hsp83, Hsp83(E35A), 
Hop, Hsc70-4, Hsc70-4(D206S), Droj2 and p23 were expressed as N-terminal 
His-tagged proteins in Escherichia coli BL21 strain. Typically, the cells in 2-litre 
cultures were cultivated to an OD¢00 nm Of 0.5 at 37 °C with 100 pg ml~ t ampicillin 
and then grown at 15 °C overnight with 1 mM isopropyl-B-p-thiogalactoside (IPTG). 
The cell pellets were resuspended in His A buffer (30 mM HEPES-KOH pH 7.4, 
200 mM KOAc, 2mM Mg(OAc)2, 5% glycerol, 20 mM imidazole, 0.2 mM TCEP) 
containing 1X EDTA-free protease inhibitor cocktail (Roche), sonicated, and cen- 
trifuged at 10,000g for 20 min. The supernatant was loaded onto HisTrap FF crude 
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5 ml (GE Healthcare) and eluted with linear gradient from His A buffer to His B 
buffer (His A buffer containing 400 mM imidazole). The peak fractions were col- 
lected, diluted ten times with 30 mM HEPES-KOH pH 7.4, 2mM Mg(OAc)s, 5% 
glycerol, 1 mM dithiothreitol (DTT), loaded onto MonoQ 1 ml (GE Healthcare), 
and eluted with linear gradient from MonoQ A buffer (30 mM HEPES-KOH pH 
7.4, 20mM KCl, 2mM Mg(OAc)s, 5% glycerol, 1 mM DTT) to MonoQ B buffer 
(MonoQ A buffer containing 1 M KCl). The peaked fractions were collected, buf- 
fer-exchanged to 30 mM HEPES-KOH pH 7.4, 100 mM KOAc, 2mM Mg(OAc)s, 
10% glycerol, 1 mM DTT with PD-10 (GE Healthcare). For Hsp83 recombinant 
protein purification, glycerol and reducing reagent were omitted from all the pro- 
cedures. The expression and purification of the complex of streptavidin-binding 
peptide (SBP)-Hop, endogenous Hsp83, and endogenous Hsc70-4 were performed 
according to the method previously described with some modifications**””. Briefly, 
S2 cells (1.0 X 10° cells ml” ') were transfected with 10 1g pASW-Hop and 20 pl 
X-tremeGENE HP (Roche) per 10 ml and cultured for 72h. Transfected S2 cells 
were collected and washed once with PBS. Cell pellets were resuspended with twice 
the volume of hypotonic lysis buffer (10 mM HEPES-KOH pH 7.4, 10 mM KCl, 
1.5mM MgCh, 1X EDTA-free protease inhibitor cocktail (Roche)), incubated for 
30 min, and centrifuged at 17,000g for 20 min. Typically, 2 ml supernatant was 
incubated with 200 pl Streptavidin Sepharose HP (GE Healthcare) for 1h. The 
beads were washed five times with lysis buffer and SBP-Hop and its binding pro- 
teins were eluted with 100 ul lysis buffer containing 5 mM biotin for 30 min. The 
proteins around 85 kDa and 70kDa (Extended Data Fig. 1a) were identified as 
Hsp83 and Hsc70-4, respectively, by mass spectrometry as previously described’. 
Recombinant Dicer-2-R2D2 heterodimer was prepared by plasmid-directed expres- 
sion of SBP-R2D2 and Dicer-2 in S2 cells***’. 7.5 ug pAWH-Dicer-2 and 7.5 pg 
pASW-R2D2 were co-transfected per 10 ml of $2 cell culture by 30 pil X-tremeGENE 
HP (Roche). After incubation with lysate, the beads were washed five times with lysis 
buffer containing 0.8 M NaCl, and rinsed twice with lysis buffer containing 30% 
glycerol and 1 mM DTT. The Dicer-2-SBP-R2D2 heterodimer was eluted with 
lysis buffer containing 2.5 mM biotin, 30% glycerol and 1 mM DTT. All purifica- 
tion steps were performed at 4 °C. Column chromatography was performed using 
AKTA purifier (GE Healthcare). All purified proteins were shock-frozen in liquid 
nitrogen and stored at —80 °C. 

Small RNA pull-down, target cleavage and northern blot analysis. Typically, 
for 15 reactions, 30 jl Dynabeads Protein G (Invitrogen) were equilibrated with lysis 
buffer, incubated with 1 il of 1 mg ml * anti-Flag M2 antibody (F1804, Sigma) at 
4 °C for 30 min, washed three times with lysis buffer, and incubated at 4 °C for 1h 
with 30 ul lysate from S2 cells in which pAFW-Ago2 wild type, D965A or Y897E 
was expressed. For Extended Data Fig. 3b-d, 5 il lysate was pre-incubated with 
5nM *P-radiolabelled CXCR4 siRNAs, 1 mM ATP, 25 mM creatine monopho- 
sphate (Fluka), 0.03 U ul! creatine kinase (Cal-biochem), 0.1 U ul” RNasin Plus 
(Promega) and lysis buffer in 10 pil at 25°C for 2h. The beads were washed five 
times with wash buffer (lysis buffer containing 0.8 M NaCl and 1% Triton X-100) 
and rinsed with lysis buffer. Typically, a reaction contained approximately 40 fmol 
Ago2 protein. The beads were incubated in lysate or the reconstituted system with 
20 nM *P-radiolabelled small RNA duplex, 1 mM ATP, 25 mM creatine mono- 
phosphate, 0.03 U pl! creatine kinase, 0.1 U pl” ' RNasin Plus and lysis buffer in 
10 pil at 25 °C for 90 min. Five microlitres (Extended Data Fig. 1b, e) and 1 pl (Ex- 
tended Data Fig. 2d-f) of ago2*"* embryo lysate were used. PES (Sigma) and Celastrol 
(Sigma) were dissolved in dimethylsulphoxide (DMSO) and added into the reac- 
tions in Extended Data Fig. le, providing the final concentration of 3 mM PES and 
3 mM Celastrol with 3% DMSO. Twenty nanomolar Dicer-2-R2D2, 2.2 4M Hsp83- 
Hop-Hsc70-4 complex, 8.4 1M Droj2 and 2.2 1M p23 were used in Fig. 1a, d and 
Extended Data Figs 3, 6b and 8b, c. Twenty nanomolar Dicer-2-R2D2, 4.4 uM 
Hsp83 or Hsp83(E35A), 2.2 uM Hsc70-4 or Hsc70-4(D206S), 2.2 uM Hop, 8.4 1M 
Droj2 and 2.2 1M p23 were used in Fig. 1b, c, e, f and Extended Data Fig. 2. In 
Fig. la-c, f and Extended Data Figs 2a-c, g, h and 4d, an equimolar GST to the 
omitted protein was supplemented to normalize the total protein concentration in 
the reactions. In Extended Data Fig. 2g, h, 20 nM *’P-radiolabelled single-stranded 
guide strand was used with 100 jig ml’ yeast tRNA (Sigma). After incubation, the 
beads were washed four times with wash buffer. Bound RNAs were purified by Pro- 
teinase K treatment and run on 12% native polyacrylamide gel to separate double- 
stranded and single-stranded RNAs as previously described*'. The protein levels 
of Flag-Ago2, Dicer-2, R2D2 and the chaperone components were unaffected by 
PES, Celastrol, non-hydrolysable ATP analogues or omission or mutation of a 
component(s). For cleavage assay, beads were further rinsed with lysis buffer twice 
and incubated with 5 nM cap-labelled 182-nucleotide target RNA for Fig. 1c and 
Extended Data Fig. 2c, or 5 nM cordycepin-labelled 28-nucleotide target RNA (5'- 
GAUACUAUACAACCUACUACCUCAACCU-3’) for Extended Data Fig. 2e in 
lysis buffer containing 4 mM Mg(OAc)). The reaction was quenched by formamide 
dye and run on 8% and 15% denaturing polyacrylamide gel for 182-nucleotide and 
28-nucleotide target RNA, respectively. Estimation of RISC concentration by pre- 
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steady-state ‘burst’ kinetics was performed as previously described**. For northern 
blot, RNAs were run on 15% denaturing gel, transferred and chemically cross- 
linked to Hybond-N (GE Healthcare) by 1-ethyl-3-(3-dimethylaminopropyl) car- 
bodiimide (EDC)*”°. RNA probe 5’-GGAGCGAACUUGUUGGAGUCAA-3’ for 
esi2.1 and 2'-O-Me probe 5'-UCUUCACUAUACAACCUACUACCUCAACC 
UU-3’ for let-7 were labelled by PNK (Takara) at the 5’ end and hybridized with 
PerfectHyb plus (Sigma) at 45 °C and 68 °C, respectively. The images were acquired 
by a FLA-7000 imaging system (Fujifilm Life Sciences). 

Western blot analysis. Anti-Flag M2 (F1804, Sigma) (1:5,000), anti-tubulin (B-5- 
1-2, Sigma) (1:10,000), anti-Ago2 (ref. 18) (1:100) and anti-HaloTag (G9211, Promega) 
(1:1,000) antibodies were used as the primary antibody. Signal Enhancer HIKARI 
(Nacalai tesque) was used for Ago2 detection. Chemiluminescence was induced by 
SuperSignal West Dura substrate (Pierce) or Luminata Forte Western HRP Sub- 
strate (Millipore) and images were acquired by a LAS-3000 imaging system (Fujifilm 
Life Sciences). 

Protein labelling with Halo-tag ligands. For visualization of Halo-tagged protein 
(Extended Data Fig. 6a), 2 il lysate from S2 cells, in which pAHaloFW-Ago2 wild 
type, D965A or Y897E was expressed, was incubated with 1 1M HaloTag TMR 
ligand (G8251, Promega) and 10% DMSO in 10 ul of lysis buffer at 25°C for 
30 min. After SDS-PAGE, the labelled proteins were visualized by a LAS-3000 
image system. For single-molecule analysis, 400 jl fresh lysate was incubated with 
1M HaloTag PEG-biotin ligand (G8591, Promega) and 1% DMSO at 25 °C for 
30 min. Then, free ligands were removed by NAP-5 column (GE Healthcare) and 
equilibrated with lysis buffer containing 1mM DTT. Flow-through fraction 
(~480 pl) was divided into tubes, 10 pl each, shock-frozen in liquid nitrogen, 
and stored at —80 °C. 

Preparation of the observation chamber for single-molecule imaging. The obser- 
vation chamber was prepared essentially as previously described***’. Halo-Ago2 
was immobilized on the surface as illustrated in Fig. 2a and Extended Data Fig. 5. 
Typically, 20 ul of diluted S2 cell lysate containing biotin-labelled Halo-Ago2 or 
naive S2 cell lysate as a mock was infused into the flow chamber. Then, the glass 
surface was washed with 50 pl of wash buffer twice, rinsed with 50 tl of lysis buffer 
three times, and 20 il of lysis buffer containing 1% Biolipidure-203 (NOF Cor- 
poration). All experiments were performed at 25 + 2 °C. 

Reaction mixtures and buffers for single-molecule imaging. For snapshot obser- 
vation, the reaction mixture contained 20 nM siRNA duplex, of which each strand 
was labelled with 3’ Alexa fluorescent dye, 20 nM Dicer-2-R2D2, 1.2 uM Hsp83- 
Hop-Hsc70-4 complex, 0.75 uM Droj2, 2.7 uM p23, 1mM ATP, 25 mM creatine 
monophosphate, 0.03 U ult creatine kinase, 0.1 U wl? RNasin Plus, O scavenger 
system (6.25 U ml! glucose oxidase (G2133, Sigma), 62.5 U ml ! catalase (C40, 
Sigma) and 5.63 mg ml * glucose), 1% Biolipidure-203 and 2 mM Trolox (238813, 
Sigma) in 20 ul of lysis buffer. For continuous monitoring, the reaction mixture 
contained 10 nM siRNA duplex, of which each strand was labelled with 3’ Alexa fluo- 
rescent dye, 20 nM Dicer-2-R2D2, 1.2 uM Hsp83-Hop-Hsc70-4 complex, 0.75 uM 
Droj2, 2.7 1M p23, 1mM ATP, 25mM creatine monophosphate, 0.03 U pl”? 
creatine kinase, 0.1 U wt RNasin Plus, O2 scavenger system, 1% Biolipidure-203, 
50g ml | yeast tRNA (R5636, Sigma) and 2 mM Trolox in 20 ul of lysis buffer. 
The observation mixture contained O2 scavenger system and 2mM Trolox in 
20 ul of lysis buffer. 

General image acquisition procedures. Single-molecule images were visualized 
by a total internal reflection fluorescence microscope equipped on an inverted type 
microscope (IX71, Olympus) with a X 150 oil immersion objective lens (UAPON 
150 X OTIRFM, NA 1.45, Olympus), as previously described***'. The Alexa555 
and 647 dyes were illuminated simultaneously with an argon (Ar) laser (514 nm; 
35LAP321, Melles Griot) and a helium-neon (He-Ne) laser (633 nm; GLG5410, 
SOC), respectively. Fluorescence images from Alexa555 and 647 were separated by 
using a DualView2 (Optical Insights) and then projected side-by-side onto a back- 
illuminated electron-multiplying charge-coupled device (EMCCD) camera (iXon3 
DU-897E-CSO-#BV, 512 X 512 pixels, Andor Technology). Images are pseudo- 
coloured in green (Alexa555) or red (Alexa647) in the figures. 

General data analysis procedures for single-molecule imaging. Images were ana- 
lysed using Image J software (http://rsb.info.nih.gov/ij/) with a built-in function 
and custom-designed plug-in software. The fluorescent intensity of the spots was 
measured using 6-pixels diameter circular regions of interest (ROIs), and only spots 
with fluorescence intensities greater than 30 of the background intensity were 
analysed in order to exclude shot noise. All graphs and fitting curves were gener- 
ated using KaleidaGraph (Synergy Software) and Igor Pro (Wavemetrics). 
Measuring the steps and time constants of photobleaching. For measuring the 
steps and time constants of photobleaching (Extended Data Fig. 10a-c), 10 pillysate 
from S2 cells, in which pAHaloFW-Ago2 D965A was expressed, was pre-incubated 
with 20 nM siRNA duplex, of which the guide strand was labelled with 3’ Alexa647 
and the passenger strand with 3’ Alexa555, 1 mM ATP, 25 mM creatine monopho- 
sphate, 0.03 U ul creatine kinase, 0.1 U pl” ' RNasin Plus and lysis buffer in 20 pl 


at 25 °C for 30 min. The mixture was then incubated with 1 1M HaloTag PEG-biotin 
ligand and 1% DMSO at 25 °C for 30 min, and diluted with lysis buffer to 50 pil. 
Free ligands were removed by G-25 spin column (Invitrogen) equilibrated with 
lysis buffer. For measuring the steps of photobleaching, the flow-through fraction 
was diluted ~ 10-fold before infusion. The flow-through fraction was infused into 
the observation chamber, and after washing the chamber by wash buffer, the obser- 
vation mixture was infused into the chamber. Then six random and non-overlapping 
places were chosen and imaged continuously for 600-1,800s at a frame rate of 
1 frames s_! with either of the excitation lasers at three power densities (160, 320 
and 640 mW mm ® for the Ar laser and 3.125, 6.25 and 12.5 W mm ~* for the He- 
Ne laser). Three independent chambers were used for imaging. For measuring time 
constants of photobleaching, the mean fluorescence in each image was plotted 
versus time, and the resulting curve yielded a time constant of the photobleaching 
of each condition. We then plot the photobleaching time constants for Alexa555 
and 647 versus the inverse of power density. 

Snapshot observation of wash-resistant spots. For snapshot observation (Figs 2b, c, 
4a, b and Extended Data Figs 6e-h and 10d, e), S2 cell lysate containing biotin- 
labelled Halo-tagged Ago2 was diluted fivefold with lysis buffer just before 
immobilization to ensure the discreteness of the fluorescent spots. To measure 
roughly the time course of spot appearance (Extended Data Fig. 6c), the reaction 
mixture (see earlier) was infused into the observation chamber, and then three ran- 
dom and non-overlapping places were chosen and imaged from one flow chamber at 
each time point for 1s at a frame rate of 10framess' at a power density of 
2.5W mm ~* for the Ar laser and 360 mW mm ” for the He-Ne laser. Images were 
averaged over 10 frames and deconvoluted, and the total number of spots in the 
three images was counted for each time point. For snapshot observation of wash- 
resistant spots, the reaction mixture was infused into the observation chamber and 
incubated for 1h. After washing the chamber by wash buffer, the observation 
mixture was infused into the chamber, and three random and non-overlapping 
images were taken from each chamber for 1 s as described earlier. For each con- 
dition, at least three independent experiments were performed and the total num- 
ber of spots within 380 X 190 pixels (800 |1m?) in the centre of the view field in 
every three images taken from one flow chamber was counted. The images were 
averaged, deconvoluted, and collectively analysed as descried earlier. It is noted that 
essentially the same results were obtained regardless of deconvolution. 

Snapshot observation after protease treatment. For snapshot observation after 
protease treatment (Extended Data Fig. 6e, f), the protease mixture containing 1 U pl ~ : 
TurboTEV (T0102S, Accelagen) or 0.1 U ul! Turbo3C (H0101S, Accelagen) pro- 
tease in 20 ul of lysis buffer was infused into the wild-type Ago2-tethered chamber, 
which had been incubated with the reaction mixture containing all of the factors 
and imaged, and incubated for 30 min. After incubation, the chamber was washed 
and imaged as described earlier. 

Continuous monitoring of spot appearance. S2 cell lysate containing biotin- 
labelled Halo-tagged Ago2 was diluted tenfold (Figs 3a-d, 4c-e and Extended 
Data Figs 7a, 8d and 10f, g) or fivefold (Fig. 3e-h and Extended Data Fig. 7b-e; 
twice-denser condition to determine the time constant for RISC maturation) with 
lysis buffer just before immobilization. For the WT+TEV condition (Fig. 3b, d), 
lysate was incubated with 1 Upl~' TurboTEV for 30 min in solution and then 
diluted. Immediately after 20 11 of the reaction mixture (see earlier) was infused 
into the observation chamber, images were continuously taken for 1,200 s at a frame 
rate of 1 framess | at a power density of 250mW mm ” for the Ar laser and 
36mW mm ~* for the He-Ne laser. Stage drift was corrected using slice alignment 
plugin”. For each condition, at least three independent experiments were performed. 
To analyse single-molecule association and dissociation events, we first created a 
maximum intensity projection over all 1,200 frames in stack. Then, we picked the 
Alexa555 and 647 colocalized fluorescent spots within 380 X 190 pixels (800 um”) 
in the centre of the view field of each split image taken from one chamber. In 
Fig. 3e-h and Extended Data Fig. 7b-e, we analysed the spots that displayed within 
the entire view field (512 X 256 pixels; 1,450 um’). We then generated integrated 
intensity traces to identify concurrent landing events of Alexa555 and 647. 
Duration times (t — t,) were determined by finding the appearance time of the 
simultaneous Alexa555 and 647 signals (t,) and subtracting it from the disappear- 
ance time of either or both signals (t). 

Fluorescence intensity of single-molecule benchmarks. For measuring fluor- 
escence intensity of single-molecule benchmarks (Extended Data Fig. 10e, g), we 
prepared S2 cell lysate containing biotin-labelled Halo-tagged Ago2(D965A) pro- 
grammed with fluorescently labelled siRNA duplex, as described earlier. Three kinds 
of fluorescently labelled duplex (guide/passenger = Alexa647/555, 647/647 and 555/ 
555) were used for programming. The prepared lysate was diluted fivefold with lysis 
buffer, and was infused into the observation chamber. After washing the chamber by 
wash buffer, images were taken and analysed under the same acquisition condition 
and procedure for snapshot observation or continuous monitoring. Three independ- 
ent chambers were used for imaging for each fluorescently labelled duplex. 
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Extended Data Figure 1 | Validation of small RNA pull-down assay. 


a, CBB staining of recombinant proteins used in this study. b, Small RNA 
pull-down assay using immunopurified wild type (WT), catalytic mutant 
(Cmut; D965A) and 5’ nucleotide-binding pocket mutant (Pmut; Y897E) of 
Ago2. Ago2 alone (Buffer) could not load siRNA, but supplementation of 


lysate from ago2 


414 


mutant flies directed efficient formation of pre- and 
mature Ago2-RISC. The catalytic mutant Ago2 predominantly formed 
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pre-Ago2-RISC, while the 5’ nucleotide-binding pocket mutant failed to load 
siRNA. c, Western blotting of Ago2 and tubulin in lysate from S2 cells, 
wild-type embryo and ago2*"* mutant embryo. d, Anti-Flag western blotting 


of wild-type and mutant Ago2 proteins in lysate. All the constructs are 


expressed at similar levels. e, Hsp70 inhibitor (PES) and p23 inhibitor 
(Celastrol) block duplex loading into Ago2. 
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Extended Data Figure 2 | In vitro reconstitution of Ago2-RISC assembly 
by seven recombinant proteins. a, Quantification of Fig. 1a. b, Quantification 
of Fig. 1b. c, Time course of target cleavage in the reconstituted system. 
Figure 1c represents the data at 30 min. d, Activities of the reconstituted system 
and ago2*"* mutant lysate. e, Cleavage assay in a multiple turnover condition 
by Ago2-RISCs assembled in the reconstituted system or ago2*’* mutant 
embryo lysate (d). ago2*”* lysate was diluted to normalize the amount of the 
assembled Ago2-RISC. f, Estimated concentrations of mature single-strand- 
containing Ago2-RISCs by the pull-down assay (d) and the pre-steady-state 


‘burst’ kinetics of target cleavage (e). Owing to a technical limitation in the 
quantification of small amounts of radiolabelled RNAs, the concentrations 

of RISCs estimated by the burst kinetics were higher than those estimated by the 
pull-down assay. However, the ratios between them were essentially identical 
between RISCs assembled in the reconstituted system and ago2*"* mutant 
lysate, suggesting that the reconstituted Ago2-RISC is as fully active as the one 
assembled in lysate. g, Dicer-2-R2D2 and the chaperone machinery are 
dispensable for loading of single-stranded RNAs whereas they are essential for 
duplex loading. h, Quantification of g. Data represent mean + s.d. (n = 3). 
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Extended Data Figure 3 | Unloading is not accompanied by loading. a, Quantification of Fig. 1d. b, Experimental strategy for c. c, The amount of pre-loaded 
CXCR4 siRNA did not change during new loading of let-7 siRNA. d, Quantification ofc. Red, reconstituted system; black, GST. Data represent mean + s.d. (n = 3). 
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Extended Data Figure 4 | The reconstituted system faithfully recapitulates Middle, highly asymmetric CXCR4 siRNA. Right, relatively symmetric luc 
siRNA loading asymmetry and ATP dependency. a, Ago2-RISC was siRNA, in which the two strands can serve as the guide strand with similar 
assembled with siRNAs with either the guide strand or the passenger strand chances. b, Small RNA pull-down assay was performed in the reconstituted 
radiolabelled. Left, asymmetry-flipped let-7 siRNA. In contrast to the normal — system with 0.5mM ATP, ATP-yS, AMP-PNP or ADP. c, Quantification 
let-7 siRNA (Fig. le), the let-7 sequence in this siRNA duplex serves as the of b. d, Quantification of Fig. Lf. 

passenger strand due to the introduction of a mismatch at the opposite end. 
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Extended Data Figure 5 | Experimental scheme for single-molecule 
analysis. a-c, Halo-TEV-Flag-Ago2 was tethered to the surface as illustrated. 
For snapshot observation (a), the reaction mixture was infused into the 
observation chamber and incubated for 1 h. After washing of the surface, the 
images of the wash-resistant spots were acquired for 1s. For snapshot 


observation after protease treatment (b), the protease mixture was infused 
into the chamber, which had been imaged and incubated for 30 min. For 
continuous monitoring (c), images were acquired for 20 min without washing 
immediately after the reaction was started on the surface. 
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Extended Data Figure 6 | Halo-tagged Ago proteins and fluorescently 
labelled siRNAs used in single-molecule analysis. a, Visualization of 
Halo-TEV-Flag-tagged wild type, catalytic mutant (Cmut; D965A) and 

5’ nucleotide-binding pocket mutant (Pmut; Y897E) of Ago2 by a Halo-TMR 
ligand. b, RNA pull-down assay with siRNAs with or without 3’ fluorescent 
modifications. Red, guide strand; black, passenger strand. The 5'-phosphate of 
the guide strand was radiolabelled as denoted by yellow circles. Fluorescent 
labelling did not compromise the efficiency of RISC assembly in vitro. c, Time 
course of the guide-alone (red) or double-stranded (yellow) spot formation for 
wild-type Ago2 with the reconstituted system. d, TEV-protease-mediated 
cleavage of Halo-TEV-Flag—Ago2. After S2 lysate containing Halo-TEV-Flag- 
Ago2 was incubated with proteases (HRV3C or TEV) or lysis buffer for 30 min, 
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western blotting was performed using anti-Halo and anti-Flag antibodies. 

e, f, Representative single-molecule images (e) and quantification (f) after 
protease treatment for 30 min (Extended Data Fig. 5b). Wash-resistant spots 
observed in the WT, +D/R and +Cpn conditions (Fig. 2b, c) largely 
disappeared after TEV-protease-mediated cleavage but not after the control 
treatment of HRV3C protease. The colour scheme is the same as in Fig. 2b, c. 
g, h, Validation of pre- and mature RISC formation with inversion of the strand 
colours. Representative single-molecule images (g) and quantification (h) of 
wash-resistant spots after 60 min incubation with the reconstituted system 
together with an siRNA with a 3’ Alexa555 (green)-labelled guide strand and 
a 3’ Alexa647 (red)-labelled passenger strand. Data represent mean + s.d. 
(WT in e, n = 4; others, n = 3). 
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Extended Data Figure 8 | The 5’-phosphate of the guide strand is 
dispensable for binding to Dicer-2-R2D2 but is essential for duplex loading 
into Ago2. a, Gel shift assay with increasing concentrations of recombinant 
Dicer-2-R2D2 and passenger-strand-radiolabelled siRNA duplex with the 
5'-phosphate (left) or 5’-hydroxyl group (right) on the guide strand. The 
5'-phosphate of the guide strand is dispensable for the interaction with 
Dicer-2-R2D2. Red, guide strand; black, passenger strand. b, c, Small RNA 
pull-down assay with siRNA duplexes containing 5'-phosphate (left) or 
5'-hydroxyl group (right) on the guide strand with wild-type or the catalytic 


mutant Ago2. The 3’ end of the guide strand (b) or the 5’ end of the passenger 
strand (c) of the siRNA duplex was radiolabelled, as denoted by yellow circles. 
The 5’-phosphate on the guide strand is essential for duplex loading into 
Ago2; in the absence of the guide 5’-phosphate, the siRNA duplex is hardly 
loaded while maintaining the functional asymmetry. d, Determination of the 
time constants of short binding in the absence of 5’-phosphate recognition. 
The data in Fig. 4d were accumulated and fitted to the one-step reaction model 
as in Extended Data Fig. 7a. 
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presumably by inducing a productive state of Ago2, in a manner dependent on 
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Extended Data Figure 10 | Single-molecule properties of the observed 
fluorescent spots. a, Representative time courses of one-step (left) and 
two-step (right) photobleaching for the guide (red) or passenger (green) strand. 
Spots for wild-type Ago2 with the reconstituted system were analysed. 
Arrows show the timings of photobleaching. b, ~95% of the fluorescent spots 
showed one-step photobleaching for each colour, indicating that each spot 
contained a single RNA molecule. c, Photobleaching dependence on power 
density. Left, Alexa647. Right, Alexa555. The photobleaching time constants 
of Alexa647 and 555 under continuous monitoring condition were estimated 
as ~22,800 and ~5,900 s, respectively. Data represent mean + s.d. (n = 3). 

d, Representative observed spots in snapshot observation without 
deconvolution (raw data). The distribution of fluorescence intensity was fitted 
to Gaussian function (bottom). The diameter of the spots is comparable to the 
diameter of the Airy disk (533 nm), indicating that the spots are diffraction 
limited. Scale bar, 500 nm. e, The signal intensity distributions of observed spots 
and single-molecule benchmarks for Alexa647 (red, left) and Alexa555 (green, 
right) in snapshot observation. Observed, fluorescent spots observed in 
snapshot observation (Fig. 2c; WT, +D/R, +Cpn, yellow). Benchmark, the 
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catalytic mutant Ago2 programmed with siRNA duplex, of which the guide and 
passenger strands were labelled with Alexa647 and 555, respectively (1X, 
middle) or with the same dye (2X, bottom). Images of single-molecule 
benchmarks were taken under the same acquisition condition of snapshot 
observation. Background was subtracted from the raw data. f, Representative 
observed spots in continuous monitoring without deconvolution (raw data). 
The distribution of fluorescence intensity was fitted to Gaussian function 
(bottom). The diameter of the spots is comparable to the diameter of the Airy 
disk (533 nm), indicating that the spots are diffraction limited. Scale bar, 

500 nm. g, The signal intensity distributions of observed spots and single- 
molecule benchmarks for Alexa647 (red, left) and Alexa555 (green, right) in 
continuous monitoring. Observed, fluorescent spots observed in continuous 
monitoring (Fig. 3d; WT, +D/R, +Cpn, yellow). Benchmark, the catalytic 
mutant Ago2 programmed with siRNA duplex, of which the guide and 
passenger strands were labelled with Alexa647 and 555, respectively (1X, 
middle) or with the same dye (2X, bottom). Images of single-molecule 
benchmarks were taken under the same acquisition condition of continuous 
monitoring. Background was subtracted from the raw data. 
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MAD2L2 controls DNA repair at telomeres and DNA 
breaks by inhibiting 5’ end resection 


Vera Boersma', Nathalie Moatti!*, Sandra Segura-Bayona’, Marieke H. Peuscher!, Jaco van der Torre’, Brigitte A. Wevers!, 
Alexandre Orthwein”, Daniel Durocher”? & Jacqueline J. L. Jacobs! 


Appropriate repair of DNA lesions and the inhibition of DNA repair 
activities at telomeres are crucial to prevent genomic instability. By 
fuelling the generation of genetic alterations and by compromising 
cell viability, genomic instability is a driving force in cancer and 
ageing’. Here we identify MAD212 (also known as MAD2B or REV7) 
through functional genetic screening as a novel factor controlling DNA 
repair activities at mammalian telomeres. We show that MAD2L2 
accumulates at uncapped telomeres and promotes non-homologous 
end-joining (NHEJ)-mediated fusion of deprotected chromosome 
ends and genomic instability. MAD2L2 depletion causes elongated 
3’ telomeric overhangs, indicating that MAD2L2 inhibits 5’ end 
resection. End resection blocks NHEJ while committing to homology- 
directed repair, and is under the control of 53BP1, RIF1 and PTIP’. 
Consistent with MAD2L2 promoting NHEJ-mediated telomere fu- 
sion by inhibiting 5’ end resection, knockdown of the nucleases CTIP 
or EXO1 partially restores telomere-driven genomic instability in 
MAD212-depleted cells. Control of DNA repair by MAD212 is not 
limited to telomeres. MAD212 also accumulates and inhibits end 
resection at irradiation-induced DNA double-strand breaks and 
promotes end-joining of DNA double-strand breaks in several set- 
tings, including during immunoglobulin class switch recombina- 
tion. These activities of MAD2L2 depend on ATM kinase activity, 
RNFS8, RNF168, 53BP1 and RIF1, but not on PTIP, REV1 and REV3, 
the latter two acting with MAD212 in translesion synthesis*. Together, 
our data establish MAD212 as a crucial contributor to the control 
of DNA repair activity by 53BP1 that promotes NHEJ by inhibiting 
5’ end resection downstream of RIF1. 

As the processes underlying telomere-driven genomic instability are 
not completely understood, we performed a functional genetic screen to 
identify telomere-induced genomic instability regulators (TIGIRs). The 
TIGIR screen relies on well-controlled reversible inactivation of telomere 
component TRE2 by expressing temperature-sensitive TRF2(Ie468Ala) 
(TRF2ts) in Trf2/~ p53’ (also knownas Terf2-‘~ Trp53’~) mouse 
embryo fibroblasts (MEFs)°. At the permissive temperature (32 °C), the 
TRF2ts MEFs have intact TRF2-mediated telomere protection, but at 
non-permissive temperatures (37-39 °C) TRF2ts is inactive, causing 
ATM kinase activation, accumulation of DNA damage response (DDR) 
proteins and NHEJ-dependent ligation at chromosome ends’. In the 
TIGIR screen (Fig. 1a), prolonged TRF2-inactivation causes TRF2ts cells 
to irreversibly arrest or die owing to severe chromosome end fusion that 
drives cells into genomic crisis° *. However, diminished telomere fusion, 
such as with RNF8 or DNA ligase IV deficiency, allows survival and 
proliferation despite telomere uncapping”””’. 

In a triplicate TIGIR screen we assayed 1,976 short hairpin RNAs 
(shRNAs), targeting 391 genes linked to DDR, for shRNAs that allow 
TRF2ts cells to survive 12 days of telomere uncapping. Among shRNA 
targets substantially enriched in at least two out of three screens were 
several factors previously shown to control telomere fusion (Extended 
Data Fig. la, b), including ATM, NBS1, RAD50, 53BP1 and RNF8 


(refs 11-15). Remarkably, the most prominent screen hit, also indepen- 
dently recovered in a genome-wide TIGIR screen (our unpublished 
results), was MAD2L2 (Extended Data Fig. 1a, b). MAD2L2 has no known 
function at telomeres but acts as a non-catalytic interaction partner of 
REV1 and REV3 in translesion synthesis*. Knockdown of Mad2/2 with 
independent shRNAs markedly increased the survival of TRF2ts MEFs 
subjected to telomere uncapping, which was abolished by complemen- 
tation with exogenous shRNA-resistant human MAD2L2 (Fig. 1b, c 
and Extended Data Fig. 1c-f). Interestingly, we failed to detect a similar 
activity for REV1 or REV3 (Extended Data Fig. 1g-i). 

Indeed, enhanced survival of MAD2L2-depleted cells after long-term 
TRF2 inactivation was due to diminished telomere fusion. Mad212 
knockdown significantly reduced telomere fusions, which was prevented 
by complementation with exogenous MAD2L2 (Fig. 2a and Extended 
Data Fig. 2a). Telomeres end with a single-stranded 3’ G-overhang 
that undergoes DNA-ligase-IV/NHEJ-dependent degradation after 
loss of TRF2-mediated telomere protection’”®. Analysis of telomeric 3’ 
G-overhangs showed that MAD2L2 depletion prevents overhang loss 
at 48 h of TRE2 inactivation, further confirming that MAD212 is essen- 
tial for efficient processing of deprotected telomeres by NHE] (Fig. 2b 
and Extended Data Fig. 2b). The defect in NHEJ-mediated telomere fu- 
sion in Mad212-knockdown cells was not explained by changes in cell- 
cycle progression (Fig. 1b, 32 °C panels, and Extended Data Figs le and 
2c, d). Moreover, MAD2L2 is essential for sister-telomere fusion after 
activation of DNA repair in mitosis'® (Extended Data Fig. 2e). In line 
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Figure 1 | A functional genetic screen identifies MAD2L2 asa critical factor 
in telomere-driven genomic instability. a, Outline of TIGIR screen to identify 
factors controlling telomere-driven genomic instability. d, day. b, Survival 
assays of TRF2ts MEFs infected with control shRNA (sh) or Mad2/2 shRNAs 
(sh1-sh4), stained after growth as indicated. c, Growth curves at 39 °C of 
TRF2ts cells transduced with control or Mad2/2 sh4 shRNAs and 
complemented with empty control or RNA interference (RNAi)-resistant 
Flag-MAD212®* (data are mean + s.d. from quadruplicate technical 
replicates). 
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*These authors contributed equally to this work. 


28 MAY 2015 | VOL 521 | NATURE | 537 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


a se + cs. _[etraploidy d_ Random integration EJ5-GFP- 
ae 3 4 1.0 1.0 
= 30 re Ss 64 uo 
= oo 1 08 0.8 
3” £4] = 06 0.6 
3 20 Bol [l = 04 0.4 
uw + = 
8 15 ol g ae Oe 
6 SL © : 0.0 
g 10 Se 8 en SSS S SS 
E Po yo" WO SQN APY oY 
6 5 © g SSS ee 
2 Se 9) fo) 09) 
= 1S) R\s A\s 
oa) aes an _  Aneuploidy 
oD = ym = ac 
S55 R855 5 95 eg 
ENuNATSANNA 8] = 400 
ogeeesae 27 a8 z = 
88S 88 f£,10- YD 80 T 05h 
SS¢ S8 B14 te 
+ + 3 5 m24h 
ie £ “| 2 = 60 
o Oo o4 Rey aS 
——— S VY © wr £2 40 
32°C 24h 39°C Ser Pw » 38 
Poe sO" WS 20 
ev s 2r 
b 2a Ps 
$ eC. Sr°c 80°C [o) 8 0 
oe ie Control MAD2L2 
8 150 shRNA — shRNA 
<<? 
E 100 Ctrl sh 
8 Hl 53bp71 sh 
g 50 1 Maa22 sh2 
4 BB Mad2/2 sh3 
2 [i Maa2l2 sha 
> 
~ 
hs 
Pye yerd Wy? Oh 24h 48h 


Figure 2 | MAD212 facilitates telomeric G-overhang degradation, 
NHE)J-mediated telomere fusion, repair and CSR. a, Chromosome fusion in 
shRNA-transduced TRF2ts MEFs (n = 2-4, mean + s.e.m., ***P = 0.0002, 
** P< 00001, Kruskal-Wallis analysis of variance (ANOVA)). b, Telomeric 
single-stranded (ss) G-overhang quantification in TRF2ts MEFs at 32 °C 

and after 48 h at 37 °C or 39 °C (n = 4, except Mad2I2 sh4 at 37 °C: 

n= 1,mean + s.e.m.). c, Tetraploidy (top) and aneuploidy (bottom) after 
telomere uncapping in TRF2ts MEFs (n = 2, mean + s.d.). d, NHEJ-mediated 
repair in U2OS cells analysed by random plasmid integration (left, n = 3) or an 
EJ5-GFP reporter containing green fluorescent protein (GFP) (right, n = 6, 
mean ~ s.d.). e, MAD2L2 depletion impairs resolution of DDR foci after 
irradiation (2 Gy) in HDR-deficient (RAD51-depleted) U2OS cells, indicating 
defective NHEJ (n = 2, mean + s.e.m.). f, CSR in shRNA-transduced 
CH12F3-2 cells (n = 3, mean = s.d.). Ctrl, control. 


b 


with MAD2L2 promoting telomere fusion, chromosomal rearrange- 
ment and mis-segregation® *, MAD2L2 depletion prevented tetraploidy 
and aneuploidy after telomere uncapping (Fig. 2c and Extended Data 
Fig. 2f). 

To address whether the requirement of MAD2L2 for NHEJ extends 
beyond telomeres, we investigated whether MAD2L2 depletion affects 
end-joining in other settings. Knockdown of human MAD2L2 consid- 
erably impaired NHEJ-mediated repair of ]-Scel-induced DNA double- 
strand breaks (DSBs), NHEJ-mediated random plasmid integration, 
resolution of irradiation-induced DSBs in homology-directed repair 
(HDR)-deficient cells and clonogenic survival after irradiation, similar 
to depletion of the NHEJ-promoting factors 53BP1 and RIF1 (Fig. 2d, e 
and Extended Data Fig. 3a—e). Moreover, class switch recombination 
(CSR), a physiological process depending on joining of DNA DSBs by 
NHEJ”, was substantially impaired in MAD2L2-depleted cells (Fig. 2f 
and Extended Data Fig. 3f-i). This CSR defect was not due to defects in 
cell proliferation or expression of Aid (also known as Aicda) messenger 
RNA, IgM and IgA germ-line transcripts. Thus, MAD2L2 promotes 
NHEJ-mediated repair in several contexts. 

To investigate how MAD2L2 promotes NHE], we first addressed its 
localization. MAD2L2 accumulated both at TRF2-deprotected telomeres 
and irradiation-induced DSBs (Fig. 3a), implying a direct activity of 
MAD2L2 at these sites. Next we examined activation of the DDR. At 3, 
6 and 12h of TRF2 inactivation, both control and MAD2L2-depleted 
cells activated ATM, phosphorylated H2AX, KAP1 and CHK2 (yH2AX, 
p-KAP1 and p-CHKz2, respectively), and formed p-ATM and yH2AX 
foci to a similar extent (Fig. 3b, c and Extended Data Fig. 4a, b). More- 
over, accumulation of 53BP1, RIF1 and PTIP to uncapped telomeres, 
as well as irradiation-induced DSBs, was unperturbed in MAD212- 
depleted cells (Fig. 3c and Extended Data Fig. 4a—c). Thus, MAD2L2 is 
not required for recognition of uncapped telomeres or DSBs as damage, 
nor for initiation of DDR signalling. However, probably reflecting aber- 
rant processing of deprotected telomeres, we noticed increased DDR 
signalling at 24 and 48 h of TRE2 inactivation in MAD2L2-depleted cells, 


Figure 3 | MAD212 inhibits end 
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visible as increased H2AX and KAP1 phosphorylation, and H2AX 
ubiquitylation (Fig. 3b). 

Next we examined the terminal structure of telomeres in more detail. 
Analysis of 3’ single-stranded G-overhangs at 6 and 12h of TRF2 
inactivation revealed that in the absence of MAD2L2, telomeres gain 
considerably longer 3’ single-stranded overhangs. At 12h, Mad2I2- 
knockdown cells contained 2.5-fold higher, 3'-exonuclease sensitive over- 
hang signals than control cells without uncapping, and nearly sixfold 
higher overhang signals than control cells at 12 h of telomere uncapping 
(Fig. 3d, e and Extended Data Fig. 5a). Overhang increase upon MAD2L2 
depletion was observed in both wild-type and DNA-ligase-IV-deficient 
TRF2ts cells (Extended Data Fig. 5b-d), indicating it is not a passive 
consequence of lack of NHEJ-mediated overhang degradation, but re- 
presents an active role of MAD2L2 in inhibiting 5’ end resection. 5’ 
DNA end resection is a powerful obstruction to NHEJ by creating DNA 
ends unsuitable for ligation’’, explaining why MAD2L2 depletion im- 
pairs NHEJ-mediated repair. 

5' end resection involves the combined activities of CTIP, MRN, 
EXO1, BLM and DNA2, and is blocked by 53BP1 and its interaction 
partners RIF1 and PTIP**?™*. In line with MAD2L2 promoting NHEJ 
by impeding 5’ end resection, co-depletion of CTIP or EXO1 with 
MAD212 partially reversed the increased survival of Mad212-knockdown 
cells undergoing prolonged telomere uncapping (Fig. 3f and Extended 
Data Fig. 5e-h). Moreover, consistent with the generation of longer 
single-stranded overhangs that block NHEJ and prime for HDR, Mad2/2- 
knockdown cells showed increased phosphorylation of the ATR target 
CHK1 and of the single-stranded DNA-binding protein RPA (Fig. 3g). 
In addition, MAD2L2-depleted cells showed increased HDR at telomeres 
by increased telomere sister-chromatid exchanges (TSCE; Extended 
Data Fig. 6a). 

Because the effects of MAD2L2 depletion on end resection, telomere 
fusion and CSR resemble those of 53BP1, RIF1 and, to a certain extent, 
PTIP inactivation*’*!°*4, we asked whether MAD2L2 is epistatic with 
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53BP1, RIF1 or PTIP. Similarly, we addressed the requirement of REV1 
and REV3 for the ability of MAD2L2 to control end resection and 
telomere fusion. Unlike in wild-type or PTIP-deficient MEFs, MAD2L2 
depletion did not reduce fusion of TRF2-deprotected telomeres in 
53BP1- or RIF1-deficient MEFs (Fig. 4a and Extended Data Fig. 6b-e). 
However, MAD2L2(Leu186Ala) and MAD2L2(Cys70Arg), defective 
in REV1 or REV3 interaction, respectively”*”, were still capable of pro- 
moting telomere fusion and telomere-driven genomic instability (Fig. 4b 
and Extended Data Fig. 7a, g, i). Moreover, MAD2L2 depletion resulted 
in increased RPA phosphorylation after irradiation in wild-type, Ptip “~, 
Rev1-’~ or Rev3-’— MEFs, but not in 53bp1 a (also known as 
Trp53bp1-’~) or Rifl-’~ MEFs (Fig. 4c). This indicates that MAD2L2 
controls end resection and fusion of deprotected telomeres in a 53BP1- 
and RIF1-dependent, but PTIP-, REV1- and REV3-independent manner. 
Unanticipated, MAD2L2-depleted Rifl”‘~ MEFs showed an increased 
background of chromosome end fusion caused by telomere association 
between sister chromatids (Fig. 4a and Extended Data Fig. 6d, e). The 
molecular nature of these sister telomere associations remains to be 
determined, but they occur irrespective of TRF2 inhibition and poten- 
tially reflect replication problems”. 

To address the dependencies for MAD2L2 function in DNA repair 
further, we investigated the requirements for MAD2L2 accumulation 
to DSBs and NHE) activity at telomeres. Both DSB accumulation and 
the ability to promote telomere-driven genomic instability by MAD2L2 
require an intact HORMA domain and carboxy terminus, while poten- 
tial S/TQ (Ser or Thr residues that precede Gln) phosphorylation on 
Thr 103 appears dispensable (Fig. 4b and Extended Data Fig. 7). Inter- 
estingly, while MAD2L2 accumulation to irradiation-induced DDR 
foci (IRIFs) was independent of PTIP or interaction of MAD2L2 with 
REV1 or REV3, it clearly depended on ATM kinase activity, RNF8, 
RNF168, 53BP1 and RIF1 (Fig. 4d, e and Extended Data Figs 7c, e and 8). 
More specifically, MAD2L2 IRIF accumulation required the HEAT 
repeats of RIF1 and the same ATM/ATR phosphorylation sites in 53BP 1 
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that promote RIF1 IRIFs, while those that promote PTIP IRIFs were 
dispensable*’* (Fig. 4f, g and Extended Data Fig. 9a-c). Similarly, 
MAD212 localization to uncapped telomeres required 53BP1 (Extended 
Data Fig. 8e). In line with dependency on RIF1, whose localization to 
DSBs is strongly reduced in G2, MAD2L2 foci were also diminished in 
G2, although less than RIF1 foci (Fig. 4h and Extended Data Fig. 9d). 
The latter could relate to a proposed role for MAD212 in the late steps 
of HDR”. 

In addition to inhibiting end resection, 53BP1, but not RIF1, promotes 
NHE) by increasing mobility of uncapped telomeres’*”°. Consistent 
with MAD212 acting via end resection control downstream of RIF1, 
MAD212 deficiency did not impair mobility of uncapped telomeres 
(Extended Data Fig. 10a-c). 

Altogether, our data identify MAD2L2 as an important regulator of 
DNA repair pathway choice that promotes NHEJ at telomeres and DSBs 
in multiple settings, by inhibiting CTIP- and EXO1-dependent 5’ end 
resection downstream of RIF1 (Extended Data Fig. 10d). Whether 
MAD212 affects regulatory acetylation or phosphorylation of CTIP™, 
or blocks other end resection factors, awaits further investigation. 

MAD2L2 and REV3 together compose Pol€ and act with REV1 in 
translesion synthesis to facilitate DNA replication past blocking lesions*. 
On the basis of reduced gene conversion and impaired RPA foci, RAD51 
foci and DSB resolution in REV1- and PolC-depleted cells, it has been 
proposed that the ability of REV1/Pol€ to alleviate replication stalling 
might also be important during HDR synthesis”*. Although requiring 
further characterization, together with the data presented here, this 
suggests differential roles for MAD2L2 in DSB repair. As shown here, 
MAD212 acts downstream of 53BP1/RIF1 ina REV1/REV3-independent 
manner to promote NHE] by inhibiting resection, mostly in G1. In 
addition, MAD212, as part of PolC, might promote HDR by facilitating 
replication of damaged DNA formed after resection in G2. 

The contribution of MAD2L2 to DNA repair at uncapped telomeres 
and DSBs suggests that aberrant MAD2L2 expression could have path- 
ological consequences by compromising genome integrity. In this respect 
it is interesting that in particular MAD2L2, more than REV1 or REV3, 
is reported to be overexpressed in several cancers and has prognostic 
significance (ref. 29 and the Oncomine database). Furthermore, our find- 
ing that MAD212 is essential for CSR identifies MAD2L2 as a potential 
disease-susceptibility gene for human primary immunodeficiencies. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

TIGIR screen. Trf2’~ p53’ TRF2ts MEFs (in short TRF2ts MEFs) were infected 
in triplicate with PLKO lentiviral shRNA pools consisting of 1,976 shRNAs selected 
from the Mouse TRC 1.0 Collection. Cells were infected with a multiplicity of infec- 
tion below 1 and drug selected on 4 pg ml‘ puromycin. After completion of selec- 
tion, TRF2ts MEFs were plated in duplicate on 15-cm dishes at 150,000 cells per 
dish and allowed to adhere at 32 °C. One set of plates was kept at 32 °C for 4 addi- 
tional days, the other set of plates was subjected to 12 days of telomere uncapping at 
39 °C, followed by recovery for 4 additional days at 32 °C. Cells were collected for 
genomic DNA isolation using a DNeasy Blood & Tissue Kit (Qiagen) and shRNA 
inserts were recovered from genomic DNA by 2 sequential PCR amplifications 
(PCRI1 and PCR2). In PCR1 6 individual PCR reactions were performed on 325 ng 
genomic DNA using for each sample a common reverse primer P7_PLKO1_R(5’- 
CAAGCAGAAGACGGCATACGAGATTTCTTTCCCCTGCACTGTACCC-3’) 
and 1 of 6 forward primers with a unique index (underlined): Illuseq_01_PLKO1_f 
(5'-ACACTCTTTCCCTACACGACGCTCTTCCGATCTCGTGATCTTGTGG 
AAAGGACGAAACACCGG-3’), Illuseq_02_PLKO1_f (5’-ACACTCTTTCCCT 
ACACGACGCTCTTCCGATCTACATCGCTTGTGGAAAGGACGAAACACC 
GG-3’), Illuseq_03_PLKO1_f (5’-ACACTCTTTCCCTACACGACGCTCTTCCG 
ATCTGCCTAACTTGTGGAAAGGACGAAACACCGG-3’), Hluseq_04_PLKO1_f 
(5'-ACACTCTTTCCCTACACGACGCTCTTCCGATCTTGGTCACTTGTGG 
AAAGGACGAAACACCGG-3’), Iluseq_05_PLKO1_f (5’-ACACTCTTTCCCT 
ACACGACGCTCTTCCGATCTCACTGTCTTGTGGAAAGGACGAAACACC 
GG-3’), Illuseq_06_PLKO1_f (5'-ACACTCTTTCCCTACACGACGCTCTTCC 
GATCTATTGGCCTTGTGGAAAGGACGAAACACCGG-3’). In PCR2, 2.5 pil 
DNA from each PCR1 reaction was amplified using primers P5_Illuseq (5’-AAT 
GATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGACGCTCTT 
CCGATCT-3’) and P7 (5’-CAAGCAGAAGACGGCATACGAGAT-3’). PCR pro- 
ducts were pooled per condition, purified using a Qiagen PCR purification kit 
(Qiagen), analysed on a bioanalyzer for quality and concentration, and subjected to 
deep-sequencing on a Illumina Hiseq2000 genome analyser at the NKI Genomics 
Core Facility. 

Cell culture, growth assays, flow-cytometry. Trf2’ ~ p53 /~ TRE2ts MEFs (TRE2ts 
MEBs) and Trf2’~ p53’ Ligd ‘~ TRE2ts MEFs (TRE2ts Ligd ’ MEFs) were 
generated from Trf2”” p53’ MEFsand Trf2”” p53 ’~ Ligd ‘~ MEEsas des- 
cribed before". All cells were grown in DMEM with 100 U penicillin, 0.1 mg ml 
streptomycin, 2 mM L-glutamine and 10% FBS. TRF2ts MEFs were maintained at the 
permissive temperature of 32 °C and only grown at 37 °C or 39 °C to induce telomere 
uncapping through inactivation of TRF2. All other cells were grown at 37 °C. Wild- 
type, 53bp1 ’, Rift’, Ptip*/* and Ptip ’~ MEFs were SV40 immortalized. 

For colony survival assays, TRF2ts MEFs were plated at 150,000 cells per 15-cm 
dish or 45,000 per 10-cm dish, allowed to adhere and kept at 32 °C or placed at 
39 °C. To evaluate potential toxicity or growth rate differences, plates were fixed 
with 10% formalin and stained with 0.1% crystal violet after growth for 1 week at 
32 °C. To evaluate survival upon prolonged telomere uncapping plates were fixed 
and stained after growth for 12 days at 39 °C, followed by 0 or 2 weeks recovery at 
32°C. Quantifications of survival assays reflect the relative survival after growth 
for 12 days at 39 °C and 2 weeks recovery at 32 °C, corrected for plating efficiency 
and growth at 32 °C and assessed by crystal violet extraction as described below. 

To address cell survival under telomere uncapping conditions in short-term growth 
assays, TRF2ts MEFs were plated in quadruplicate at 5,000 cells per well ona 12-well 
plate (Fig. 1c) or in duplicate at 2,500 cells per well on a 24-wells plate (Extended 
Data Fig. 5h), allowed to adhere overnight at 32 °C and then placed for 12-14 days 
at 39 °C. During this time period of non-permissive temperature, plates were fixed 
every 2-4 days with 10% formalin and stained with 0.1% crystal violet. Crystal 
violet was extracted from the plates using 10% acetic acid and its absorbance at 
595 nm was quantified using a Tecan microplate reader. Quantifications were cor- 
rected for plating efficiency. 

Cell cycle distribution of TRF2ts MEFs with or without MAD2L2 knockdown 
was determined by propidium iodide staining, as well as by BrdU incorporation, and 
acquired ona BD Fortessa using FACSDiva software (BD Biosciences) and analysed 
with FlowJo (TreeStar) software. 

Analysis of tetraploidy (8N) and aneuploidy (>4N) was based on DNA-content 
analysis by flow cytometry of TRF2ts cells grown at 32 °C or 39°C for 48 h and 
then stained with propidium iodide. 

Cell cycle synchronisation was done as follows. For G1 cells, RPE cells were kept 
in 2.5 mM thymidine (T1895, Sigma) for 16 h, released for 6 h, treated with 10 1M 
RO3306 (Calbiochem) for 16h. Synchronised cells were released for 1h, mitotic 
cells were collected by mitotic shake off and replated to let them continue into G1 
3.5h before irradiation. 

For G2 cells, RPE cells were kept in 2.5 mM thymidine for 16h, released for 6h 
and treated with 2.5 mM thymidine for 16 h. Synchronised cells were released for 
4.5 h before irradiation to let cells progress to G2. 
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Plasmids. Cells were transduced as before"! with the following pLKO-puro shRNA 
lentiviruses obtained from Mission library clones (Sigma) against mouse genes: 
Maad2I2 sh1 (TRCN0000012843: 5'-CCAGACTCCAAGTGCTCTTAT-3’), Mad2l2 
sh2 (TRCN0000012844: 5’-CCAGTGGAGAAGTTTGTCTTT-3’), Mad2l2 sh3 
(TRCN0000012845: 5’-CGAGCCTTCATCCTTAAGATT-3’), Mad2l2 sh4 (TR 
CN0000012846: 5’-CATCTTCCAGAAGCGCAAGAA-3’), Rev1 sh pool (pool of 
TRCN0000120297: 5'-CAGCAGTGCTTGTGAGGTATT-3’, TRCN0000120298: 
5'-GCCGAGATCAACTATGGAATA-3’, TRCN0000120299: 5'-GCTGGAATG 
AAGACGGTGTAA-3’, TRCN0000120300: 5’-GCCCTCCGTATTGAAATCA 
AA-3', TRCN0000120301: 5'-GCACGTTGATATGGACTGCTT-3’), Rev1 sh1 
(TRCN0000120298: 5'- GCCGAGATCAACTATGGAATA-3’), 53bp1 sh1 (TRC 
N0000081778: 5'-GCTATTGTGGAGATTGTGTTT-3’ and against human genes: 
MAD2L2 sh (TRCN0000006570: 5’-CCCGGAGCTGAATCAGTATAT-3’), 53BP1 
sh (TRCN0000018865: 5'-GATACTTGGTCTTACTGGTTT-3’), RIF1 sh (TRC 
N0000155431: 5’-CGCATTCTGCTGTTGTTGATT-3’), PTIP sh1 (TRCN00001 
29174: 5'-CTTCAGATTCATCACCGGAAA-3’), PTIP sh2 (TRCN0000322819: 
5'-GCCATGTTCACAGCATATTAT-3’), RNF8 sh: (TRCN0000003439: 5'-GA 
AGCCGTTATGAATGTGAAA-3’), RNF168 sh (TRCN0000034137: 5’-GCAGT 
CAGTTAATAGAAGAAA-3’), REV3 sh (TRCN0000244436: 5'-TGACCTGTC 
TGAGACTATTTA-3’), RAD51 sh (TRCN0000018879: 5’-CGGTCAGAGATC 
ATACAGATT-3’). 

For knockdown of Ctip and Exol TRF2ts MEFs were transduced as before” with 
pRetroSuper-blas shRNA retroviruses containing the following target sequences: 
Ctip sh1: 5'-GCAAGGTTTACAAGTCAAAGT-3’, Ctip sh2: 5’-GCAGACCTT 
TCTCAGTATA-3’, Ctip sh3: 5'-GCATTAACCGGCTACGAAA-3’, Exol shl: 
5'-GCATTTGGCACAAGAATTA-3’. Retroviral shRNAs used for knockdown 
of Rnf8, Lig4 or TRF2 have been described before’. The 53bp1 shRNA vector used 
in CSR assays was obtained from Origene (FI334760) and contained the following 
target sequence: 5'-GGAAGTTGAAACCAGTGTGATTAGTATTG-3’. 

For localization and complementation experiments human MAD2L2 comple- 
mentary DNA was Gateway-cloned (Invitrogen) into pMSCV-retroviral vectors 
containing coding sequences for amino-terminal Flag or GFP epitopes. MAD2L2 
deletion constructs were generated through Gateway-cloning (Invitrogen). The 
target sequences of Mad2I2 sh4 or the human MAD2L2 shRNA were changed to 
5'-GATTTTTCAGAAACGCAAGAA-3’ or 5'-TCCCGAACTGAATCAGTATA 
T-3’, respectively, to create RNAi-resistant epitope-tagged MAD212" in pMSCV. 
GFP-RIF1, Flag-53BP1-DB and PTIP-GFP expression vectors were described 
before”*”*. The target sequence of the human RIF] shRNA was changed to 5'-GCA 
GCGAAGTTGAAACTTGAA-3’ to create RNAi-resistant GFP-RIF1. 
Quantitative real-time PCR. RNA was isolated using TRIzol reagent (Ambion) 
and reverse transcribed into cDNA using AMV first-strand cDNA synthesis kit for 
RT-PCR (Roche) according to the manufacturer’s instructions. Quantitative real- 
time PCR (qRT-PCR) on cDNA was performed in triplicate using Power SYBR 
Green PCR Master Mix (Applied Biosystems) on the StepOnePlus real-time PCR 
system. 

The following primers were used: for mouse Mad2/2 forward 5'-ACACTCCAC 
TGCGTCAAACC-3’ and reverse 5’-AAAGACAAACTTCTCCACTGGGC-3’; for 
human MAD2L2 forward 5'-CGAGTTCCTGGAGGTGGCTGTGCATC-3’ and 
reverse 5’-CTTGACGCAGTGCAGCGTGTCCTGGATA-3’; for human RNF8 
forward 5'-TGGAAGAGCTAAATCGCAGCA-3’ and reverse 5'- TCAAGGTG 
ACAGCCTCAATGAA-3’; for human RNF168 forward 5'-GGCGAGTTTATGC 
TGTCCCT-3’ and reverse 5'- GCCGCCACCTTGCTTATTTC-3’; for mouse Rev1 
forward 5'-AAGCCAGATGGTCAGTACCACC-3’ and reverse 5'-TCCATTGA 
ACGTCCAACTCCCG-3’; for human REV3 forward 5'-AGGACTCGAAGTCA 
CCTATGC-3’ and reverse 5’-AGAGGTAACCCCAGGAATGC-3’; for mouse Ctip 
forward 5'-GTGCTGGGTCGGAGCAG-3’ and reverse 5'-TTGACTTGTAAAC 
CTTGCACTTCC-3’; for mouse Exol forward 5'’-TAAACACGTCGAGCCTGT 
CC-3' and reverse 5'-CAGAGCCCAGGAACCTTGTT-3’; for human RADS51 
forward 5'-GTGGAGCTAATGGCAATGCAG-3’ and reverse 5’-TTAGCTCCT 
TCTTTGGCGCA-3’; for mouse Hprt forward 5'-CTGGTGAAAGGACCTCTC 
G-3’' and reverse 5'-TGAAGTACTCATTATAGTCAAGGGCA-3’; for human 
GAPDH forward 5'-GAAGGTGAAGGTCGGAGTC-3’ and reverse 5’-GAAGA 
TGGTGATGGGATTTC-3’. 

Immunoblotting. Whole-cell lysates were prepared by scraping cells in SDS sample 
buffer; protein concentration was determined by standard BCA protein assay 
(Pierce) and equal amounts of protein were separated on precast 4-12% Bis-Tris 
or 3-8% Tris-acetate gels (Invitrogen). Immunoblotting was done according to 
standard methods using IRDye800CW- and IRDye680-labelled secondary anti- 
bodies for detection on the Odyssey Infrared imager (LI-COR) or using horseradish 
peroxidase (HRP)-conjugated secondary antibodies for detection by enhanced 
chemiluminescence (Supersignal, Thermo Scientific). Primary antibodies used were 
against MAD21L2 (sc135977, Santa Cruz, 1:500), phospho-ATM $1981 (4526, Cell 
Signaling, 1:1,000), phospho-KAP1 S824 (Bethyl, 1:1,000), 53BP1 (NB100-305, Novus, 
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1:500), phospho-H2AX S139 (clone JBW301, Upstate, 1:1,000), CHK2 (611570, 
BD, 1:1,000), phospho-CHK1 S345 (2348, Cell Signaling, 1:1,000), phospho-ATR 
$428 (Cell Signaling, 1:1,000); RPA34 (RPA34-20, Gene Tex, 1:1,000); phospho- 
RPA32 S4/S8 (A300-245A, Bethyl, 1:1,000); CTIP (sc22838, Santa Cruz, 1:500); 
y-tubulin (GTU-88, Sigma, 1:5,000); B-catenin (610154, BD Biosciences, 1:2,000); 
GFP (a-11122, Invitrogen, 1:500); RIF1 (sc55979, Santa Cruz, 1:1,000); PTIP (ABE69, 
Millipore and ab2614, Abcam, 1:500); actin (CPO1, Calbiochem, 1:5,000). Immuno- 
blotting for tubulin, B-catenin or actin served as loading controls. 
Immunofluorescence. For immunofluorescence, cells were grown on 8-well cham- 
ber slides (Lab-Tek) and in cases of MAD21L2 detection first pre-extracted for 30s 
to 5 min with 0.5% Triton/PBS on ice, then fixed for 10 min in 4% paraformalde- 
hyde in PBS. Cells were permeabilized for 5 min (MEFs) or 10 min (U2OS) in 0.5% 
Triton/PBS, washed twice in PBS, incubated for 1 h in blocking solution (0.02% 
Triton, 5% NGS, 5% FCS in PBS) and overnight at 4 °C with primary antibodies 
in blocking solution. Immunofluorescence for PTIP-GFP in U2OS was done as 
described**. Primary antibodies used were against phospho-ATM $1981 (4526, Cell 
Signaling, 1:1,000), phospho-H2AX Ser 139 (05-636, Millipore and 2577S, Cell 
Signaling, 1:500), 53BP1 (A300-272A, Bethyl, 1:2,000), mRIF1 (gift from R. Chap- 
man, 1:500)”, RIF1 (A300-569A, Bethyl, 1:1,000), MAD2L2 (sc135977, Santa Cruz, 
1:100), Flag (F3165 or F1804 at 1:500, F7425 at 1:400, Sigma), GFP (a-11122, 
Invitrogen, 1:500). Cells were washed three times with 0.02% Triton/PBS, followed 
by incubation with Alexa Fluor 488 or 568 goat anti-mouse or anti-rabbit IgG sec- 
ondary antibodies (Invitrogen) in blocking solution for 1h. After four washes in 
0.02% Triton/PBS, slides were mounted in Vectashield (Vector Laboratories) con- 
taining DAPI. Confocal fluorescence images were obtained on a Leica SP5 confocal 
system equipped with an Ar, Kr and HeNe laser system. Images were taken with a 
X63 NA 1.32 oil objective and standard LAS-AF software. Possible crosstalk between 
fluorochromes was avoided by careful selection of imaging conditions. A minimum 
of 100 cells per condition per experiment was analysed. DDR foci in MEFs were 
captured using the Metafer4/MetaCyte platform (MetaSystems) equipped with an 
AxioImager Z2 microscope (Carl Zeiss). Images of random selections of cells were 
acquired with an EC ‘Plan-Neofluar’ x 40/0.75 objective. Analysis was done using 
MetaCyte software, carefully fine-tuned for each antibody. A minimum of 500 cells 
was analysed per condition per experiment for p-ATM, yH2AX, 53BP1 and RIF1. 
All assessments of irradation-induced DDR foci were done 3h after irradiation 
with 5 Gy, with the exception of Fig. 3a, which is 30 min after 10 Gy. 

Metaphase chromosome analysis. Cell collection, preparation of metaphase spreads 
and telomere FISH with a FITC-OO-(CCCTAA)s peptide nucleic acid custom probe 
(Biosynthesis) for metaphase chromosome analysis was done as described’. 

Digital images of metaphases were captured using the Metafer4/MSearch auto- 
mated metaphase finder system (MetaSystems) equipped with an AxioImager Z2 
microscope (Carl Zeiss). After scanning metaphase preparations at X10 mag- 
nification, high-resolution images of metaphases were acquired using a ‘Plan- 
Apochromat’ X63/1.40 oil objective. Chromosome fusions were quantified from 
>30 metaphases. 

For monitoring mitotic telomeres by PNA FISH, IMR90 (E6/E7) cells were 
transduced with pMX-Flag-53BP1 (1-1711) or its mutant Thr1608Ala/Ser1618Ala 
(TASA) and pMX-GFP-RNFS8 or its mutant Thr198Ala and selected with 0.2 pg ml ~ : 
puromycin, for 3 days before plating. Cells were transfected with siRNAs targeting 
endogenous RNF8 (ThermoFisher, D-006900-01, target sequence: 5’-AGAAUG 
AGCUCCAAUGUAUUU-3’) and 53BP1 (ThermoFisher, D-003548-01, target 
sequence: 5’-GAGAGCAGAUGAUCCUUUA-3’) and the following siRNAs: PTIP 
siRNA pool (ThermoFisher, D-012795-01-0010, target sequences: 5’- UUAAAG 
CGAGCAAGUAUUA-3’, 5'-GAGCCUGGGUUGAUUAACU-3’, 5'-GUUGA 
UGAGUAUAAGACUA-3’ and 59-GCCAAUGCAGUGCUGUUUA-3’), RIF1 
(ThermoFisher, D-027983-02, target sequence: 5’-AGACGGUGCUCUAUUGU 
UA-3'), MAD2L2 (ThermoFisher, D-003272-10:5’-GAGAAAUUCGUCUUUG 
AGA-3') and non-targeting control siRNA (ThermoFisher, D-001210-02, target 
sequence: 5'-UAAGGCUAUGAAGAGAUAC-3’) using RNAiMax (Invitrogen) 
and synchronized with a double-thymidine block. Cells were subsequently pro- 
cessed as previously described’*. 

Pulsed-field gel electrophoresis and in-gel detection of telomeric DNA. Analysis 
of 3’ single-stranded G-overhangs at mouse telomeres was performed as described* 
by pulsed-field gel electrophoresis and in-gel hybridization of a **P-labelled telo- 
meric repeat (CCCTAA), oligonucleotide to native DNA. In brief, Trf2 p53’ ~ 
TRF2ts MEFs were collected at the permissive temperature as well as after growth 
at the non-permissive temperature. Agarose plugs were prepared containing 1 X 10° 
cells per plug and digested overnight with 1 mg ml * proteinase K at 50 °C. Before 
loading on a 1% agarose gel, plugs were digested overnight with 60 U of Mbol 
enzyme per plug. After running the gel on a CHEF-DR III pulsed-field gel electro- 
phoresis system (BioRad), the gel was dried, pre-hybridized in Church Mix con- 
taining 7% SDS and hybridized overnight with a **P-labelled (CCCTAA), probe, 
all under non-denaturing conditions. Signal was captured on a phosphorimager 


and analysed with AIDA Image Analyzer software. To confirm that the G-overhang 
signal detected with this method was indeed derived from 3’ single-stranded 
telomeric TTAGGG repeats, digestion with 3’ exonuclease and hybridization with 
a **P-labelled (TTAGGG), probe specific for the C-rich strand was performed on 
separate plugs. After capturing the hybridization signal from single-stranded telo- 
meric DNA, the gel was denatured and re-hybridized with the same probes to obtain 
a total telomere signal for the purpose of quantification. 

Repair and clonogenic survival assays. To quantify the repair of DSBs by NHEJ, 
U20S cells transduced with shRNAs were transfected with pEJ5-GFP (NHEJ), as 
well as an mCherry expression vector and pCAGGS-Isce-I-Trex2 (ref. 30). Trans- 
fections were done using Mirus TransIT LT-1 (Mirus Bio LLC). The fraction of 
mCherry-positive cells that was GFP-positive was determined 72 h after transfec- 
tion by flow-cytometry on a BD LSRFortessa using FACSDiva software. Quantifi- 
cation was carried out with FlowJo software (TreeStar). 

For random plasmid integration assays 500,000 U2OS cells were seeded in 6-cm 
dishes 24h before transfection with 3.9 ug of Ndel-linearized pMSCVblas-GFP 
and 0.1 ug of mCherry plasmids using ViaFect (Promega). Transfected cells were 
trypsinized the following day and seeded in 10-cm dishes at different densities for 
colony formation. Selection was initiated the following day with blasticidin at 
5 ug ml’. Cells were fixed with 4% paraformaldehyde 10-14 days after plating 
and stained with 0.1% crystal violet. Colony counting was performed on a Col- 
Count (Oxford Optronix) and values were normalized for plating and transfection 
efficiencies. 

For clonogenic survival assays U2OS cells were seeded in duplicate in 6-well plates 

at 100, 200 and 400 cells per well for unirradiated cells, and at 400, 800 and 1,600 
cells per well for irradiated cells. Cells were grown for <20h before y-irradiation 
and then fixed after 10-12 days of recovery with 4% paraformaldehyde and stained 
with 0.1% crystal violet. Colonies with >50 cells were counted on a GelCount (Oxford 
Optronix) and the surviving fraction was determined. 
CSR assays. Research involving animals was performed in accordance with protocols 
approved by the animal facility at Toronto Centre for Phenogenomics (Toronto). 
Primary B cell extraction was performed as described before”. In brief, mature B 
lymphocytes were isolated from spleens of 8-12-week-old C57BL/6 mice by deple- 
tion of CD43* cells using CD43 microbeads (Miltenyi Biotech) according to the 
manufacturer’s instructions. Purified B cells were resuspended at a concentration 
of 10° cells per ml in the presence of 25 jig ml’ LPS (Sigma-Aldrich) to allow B cell 
proliferation. Both mouse CH12F3-2 (ref. 31) and primary B cells were infected 
with lentiviral/retroviral particles obtained from the supernatant of HEK293T cell 
cultures cotransfected with the lentiviral/retroviral shRNA constructs and the vector 
expressing MLV Gag-Pol (for retrovirus) or the lentiviral psPAX2 packaging vector 
with VSV-G envelope (2:1:1 ratio). In brief, the viral supernatant was concentrated 
at 25,000g for 90 min at 25 °C before resuspension in RPMI culture medium. Infec- 
tion was subsequently performed as described” and followed by selection with 
puromycin. 

IgM to IgA switching was assayed in CH12F3-2 cells activated with 1 ngml* 
TGF-B1 (Preprotech), 10 ng ml IL-4 (Preprotech) and 1 pg ml ~ i agonist anti-CD40 
(BD). IgA expression was measured by flow-cytometry using biotin-conjugated 
anti-mouse IgA antibody (1:200; eBioscience) followed by an anti-biotin-APC 
antibody at 0, 24 and 48 h after activation. Proliferation of the different transduced 
CH12F3-2 cell lines was monitored using CFSE (Invitrogen) following the manu- 
facturer’s guidelines. CSR assays in both cell types were done in triplicate for every 
independent experiment. 

Ex-vivo isotype switching to IgG1 in primary B cells was assayed by flow- 
cytometry using an anti-IgG1-biotin (1:200; BD) followed by allophycocyanin 
(APC)-conjugated anti-biotin (Miltenyi Biotech) and 10 1g ml~' propidium iodide 
5 days after isolation. 

For semiquantitative RT-PCR analysis of Aid mRNA, IgM and IgA germ-line 
transcripts total RNA was extracted with TRIzol (Invitrogen) and cDNA was syn- 
thesized by random priming using the Superscript III reverse transcriptase kit 
(Invitrogen). Mouse Aicda transcripts were amplified using oligonucleotides 5’-G 
TGCCACCTCCTGCTCACTGG-3’ and 5'-TTCATGTAGCCCTTCCCAGGC-3’. 
IgM (1) germ-line transcripts were amplified using oligonucleotides 5'-AAAATG 
TCCGCTGGGCTAAG-3’ and 5’-AGAACAGTCCAGTGTAGGCAGTAGA-3’, 
IgA (Ix) using 5’-CCTGGCTGTTCCCCTATGAA-3’ and 5’-GAGCTGGTGGG 
AGTGTCAGTG-3’; and Gapdh transcripts were amplified using oligonucleotides 
5'-AGCCTCGTCCCGTAGACAA-3’ and 5’-AATCTCCACTTTGCCACTGC-3’. 
53BP1 depletion is known to impair CSR’ and was included as control. 

TSCE. TSCE were detected by chromosome-orientation FISH, essentially as 
described* with minor modifications. In brief, for chromosome-orientation FISH 
cells were grown in 10 1M BrdU:BrdC (3:1) for 24h with the addition of 0.2 1g ml" 
demecolcine for the final 2 h. Slides were treated with RNase A (0.5 mg ml ) for 
10 min at 37 °C, stained with Hoechst 33258 (0.5 ug ml ') in 2X SSC for 15 min 
at room temperature and exposed to 5.4 X 10°Jm * 365-nm ultraviolet light 
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(Stratalinker 2400 UV irradiator). Following digestion with exonuclease III 
(10U wll, Promega) for 10 min at room temperature, slides were dehydrated 
through an ethanol series (70%, 95% and 100%) and incubated sequentially with 
Cy3-TelG 5’-[TTAGGG]3-3’ and FITC-TelC 5’-[CCCTAA]3-3’ probes at room 
temperature. 

Telomere mobility. Telomere mobility analysis was done as described’? with minor 
adjustments. TRF2ts cells expressing eGFP-TRF1 and mCherry-BP1-2 (53BP1 
amino acids 1220-1711) were seeded on Willco wells and left to attach overnight 
at 32°C. Before imaging, cells were incubated at 37 °C for 30 min in Leibovitz’s 
L15 medium inside the Microscope climate chamber. Cells were monitored every 
minute for 20 min (t = 20 frames) at 37 °C. Five-micrometre Z-stacks at 0.5-pm 
steps in both eGFP and mCherry channels were obtained with a X63 1.4 oil objec- 
tive using ZEN software on an Observer Z1 microscope system (Carl Zeiss). Images 
were obtained at 2 X 2 binning with 512 X 512 pixels. Tracking of telomeres was 
performed with FIJI software, cells were registered with the Rigid body option from 
the StackReg plugin. Particles were then tracked with the particle detector and 


LETTER 


tracker plugin from Mosaic. Telomeres to track were selected on basis of coloca- 
lization with mCherry-BP1-2, and tracked for at least 17 out of 20 frames. 
Database. To address whether changes in MAD2L2 mRNA expression occur in 
human cancers we consulted the Oncomine database at https://www.oncomine. 
org/resource/login. html. 

Statistics. No statistical methods were used to predetermine sample size. P values 
were determined by Kruskal-Wallis ANOVA test. 


30. Bennardo, N., Gunn, A, Cheng, A., Hasty, P. & Stark, J. M. Limiting the 
persistence of a chromosome break diminishes its mutagenic 
potential. PLoS Genet. 5, e1000683 (2009). 

31. Kato, H. et al. Involvement of RBP-J in biological functions of mouse 
Notch1 and its derivatives. Development 124, 4133-4141 (1997). 

32. Celli, G.B., Denchi, E. L. & de Lange, T. Ku70 stimulates fusion 
of dysfunctional telomeres yet protects chromosome ends 
from homologous recombination. Nature Cell Biol. 8, 885-890 
(2006). 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


shRNA dq Gontrol sh Madal2 sha 
TRCN0000012846:Mad2I2 | (s‘ wd Control Control fl-Mad2i2"" 
TRCN0000040875:Rnf168 SSS] e P> Sy erie. e » xg 
7d ' Sgt or 
Bec | SF Mew 
[TRCNOoO0081778:TrpSsbpi_ [| —.OSSSC*d J SF eS 
7 12d < Meer 
39°C 
* [ee ee |-ctenin 
14d 


Ratio >1.5 in 1 of 3 screens 


=| 


nN 
Le) 
fo) 
| 


3 


Co 


Extended Data Figure 1 | Related to Fig. 1. a, A functional genetic screen for 
TIGIRs identifies independent shRNAs against Mad2/2 and the previously 
identified regulators of NHEJ-mediated telomere fusion Atm, Nbs1 (also 
known as Nbn), Rad50, 53bp1 and Rnfs. Listed are the independent shRNAs 
enriched >1.5-fold in at least two out of three TIGIR screens, with their average 
ratio of enrichment over all three screens. Ratios reflect shaRNA abundance 
after 12 days of telomere uncapping at 39 °C followed by 4 days recovery at 
32 °C versus shRNA abundance after growth for 4 days at 32 °C. b, Additional 
shRNAs targeting Mad2I2 or known TIGIRs that were enriched >1.5-fold in 
one out of three screens, with their ratio. Not shown are additional shRNAs 
against factors not previously implicated in control of telomere fusion, that 
were considerably enriched in these TIGIR screens but await validation. 

c, RT-PCR analysis of Mad2I2 expression levels in TRF2ts MEFs transduced 


| shRNA____| Ratio A | Ratio B | Ratio C | 
[TRCN0000120297:Revi__ | 0.02 | 0.10 | 0.16 _| 
[TRCN0000120299:Revi__| 0.05 | 0.19 | 0.18 | 
[TRCNO000120300:Rev1__| 0.02 | 0.40 | 1.41 | 
[TRCN0000120301:Revi__| 0.02 | 0.05 | 0.34 | 
[TRCNO000119967:Rev3l__| 0.02 | 0.72 | 0.38 _| 
[TRCN0000119968:Rev3I__ | 0.12 | 0.15 | 0.36_| 
[TRCNO000119970:Rev3il__ | 0.31 | 0.19 | 0.32 | 
[TRCNO000119971:Rev3l_ | 0.04 | 0.15 | 0.27 | 
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with four independent shRNAs targeting Mad2/2 and used in Fig. 1b (error bars 
denote s.d.). d, Survival assay of TRF2ts cells infected with control or Mad2I2 
sh4 shRNAs, complemented with empty control or RNAi-resistant Flag- 
MAD212®* and grown as indicated. e, Western blot showing expression of 
endogenous MAD212 and exogenous Flag-MAD212 in TRF2ts cells used in d 
and in Fig. Ic. f, Photograph of shRNA-transduced TRF2ts cells grown for 
12 days at 39 °C. Scale bar, 100 Lm. g, None out of ten shRNAs targeting Rev! or 
Rev3 were substantially enriched in any of three independent DDR TIGIR 
screens. h, shRNA-mediated Rev1 knockdown does not increase survival after 
prolonged telomere uncapping in survival assays of TRF2ts MEFs. Of note, 
Rev3 knockdown compromised viability and was therefore not informative in 
these assays. i, RT-PCR analysis of mouse Rev1 expression levels of cells 
shown in h (error bars denote s.d.). 
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Extended Data Figure 2 | Related to Fig. 2. a, Representative metaphase 
spreads of TRF2ts MEFs transduced as indicated, collected after 24h at 39 °C 
for telomere FISH. Original magnifications, 63. b, Representative telomeric 
single-stranded G-overhang (ss TTAGGG) and total telomere (total 
TTAGGG) analysis in TRF2ts MEFs at 32 °C and after 48 h at 37 °C or 39°C. 
c, d, Mad2l2 knockdown does not affect cell cycle parameters in TRF2ts MEFs. 
Cell cycle phase analysis was based on propidium iodide staining of 
asynchronously growing cells (c), as well as on 1h incubation with BrdU, 
followed by detection of BrdU incorporation and propidium iodide staining 
for DNA content (d) (n = 3, mean + s.d.). e, MAD212 is required for 


sister-telomere fusion upon activation of DNA repair in mitosis. Sister- 
telomere fusions were quantified in IMR90 cells expressing exogenous 
wild-type 53BP1 and RNF8, or 53BP1 (Thr1608Ala/Ser1618Ala) (TASA) and 
RNF8(Thr198Ala) (TA) mutant alleles, and depleted for endogenous RNF8 
and 53BP1, as well as depleted for MAD2L2, RIF1 or PTIP (n = 4). f, Examples 
of DNA content profiles of TRF2ts cells transduced with the indicated shRNAs, 
grown at 32°C, or for 48h at 39°C and stained with propidium iodide. 
Analysis of the fraction of cells with 8N (tetraploid) or >4N (aneuploid) 
DNA content was done on corresponding dotplots of which the results are 
shown in Fig. 2c. 
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Extended Data Figure 3 | Related to Fig. 2. a, RT-PCR analysis of MAD2L2 
expression levels in U2OS cells infected with control or MAD2L2 shRNAs, 
and used in the repair assays shown in Fig. 2d (error bars denote s.d.). b, c, RT- 
PCR analysis of RAD51 (b) and MAD2L2 (c) expression levels in RAD51- 
depleted, E6E7-expressing U2OS cells used in the assays shown in Fig. 2e 
(error bars denote s.d.). d, Clonogenic survival assays of U2OS cells transduced 
with non-targeting control or 53BP1, RIF1 or MAD2L2 shRNAs and treated 
with the indicated doses of ionizing radiation (n = 3-4, mean + s.e.m.). 

e, Western blot analysis of 53BP1, MAD2L2 and RIF1 in U2OS cells transduced 
with the indicated shRNAs. f, CSR in shRNA-transduced primary B cells 

(n = 2,mean + s.d.). g, Western blot analysis of MAD2L2 and 53BP1 in 


mouse CH12F3-2 B cells 


CH12F3-2 B cells and mouse primary B cells transduced with the indicated 
shRNAs. h, MAD2L2 depletion does not affect cellular proliferation in murine 
B cells. CH12F3-2 cells transduced with control, 53bp1 or Mad2/2 shRNAs were 
loaded with CFSE and analysed by flow cytometry at 0, 24 and 48h after 
stimulation. Profiles from all time points are plotted in the same histogram. 
i, MAD212 depletion does not affect the transcription of critical genes 
implicated in CSR. RT-PCR analysis of Aid (Aicda) mRNA, IgM and IgA 
germline transcript (GLT) levels using twofold serial dilutions of cDNA made 
from activated CH12F3-2 B cells transduced with the indicated shRNAs. Gapdh 
was used as a control for transcript expression. 
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Extended Data Figure 4 | Related to Fig. 3. a, Representative images of b, Quantification of the number of p-ATM, yH2AX, 53BP1 and RIF1 foci per 
immunofluorescence detection of p-ATM(S1981), YH2AX, 53BP1 and RIFlin cell in TRF2ts MEFs transduced with control or Mad2/2 shRNAs and grown at 
TRE2ts MEFs transduced with control or Mad2/2 shRNAs and grown at 32°C or for 3 and 12h at 39 °C (n = 2, mean + s.d.). c, Quantifications of 
32 °C or for 12h at 39 °C to induce telomere uncapping. DNA was stained with 53BP1, RIF1 and PTIP foci in U2OS cells transduced with control or MAD2L2 
4,6-diamidino-2-phenylindole (DAPI). Original magnifications, x40. shRNAs, 3h after 5 Gy (n = 2, mean + s.d.). 
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Extended Data Figure 5 | Related to Fig. 3. a, Telomeric single-stranded 
G-overhang assay of TRF2ts MEFs transduced with control or Mad2I2 sh4 
shRNAs, showing that the increase in overhang signal upon Mad2l2 
knockdown is due to 3’ terminal sequences because the signal is removed by 
treatment with Escherichia coli 3' exonuclease Exol. b, Mad212 knockdown 
causes increased single-stranded telomeric G-overhang signals in TRF2ts 
Ligd‘~ MEFs. c, Quantification of relative telomeric G-overhang signals in 
TRE2ts Lig‘ MEFs transduced with control or Mad2I2 shRNAs and grown 
at 32 °C or for 12 or 24h at 39 °C (nm = 2, mean + s.e.m.). d, RT-PCR analysis 


of Mad2I2 expression levels in TRE2ts Lig4’~ MEFs infected with control or 
Mad2I2 shRNA lentivirus (error bars denote s.d.). e, Survival assays of TRF2ts 
MEFs transduced with control or Mad212 shRNAs and subsequently with 
control, Ctip or Exol shRNAs. f, Quantification of the survival assays shown 
in e. g, (RT-PCR analysis of Ctip and Exo1 expression levels of cells shown in 
e, f and h and in Fig. 3f (error bars denote s.d.). h, Growth curves at 39 °C of 
TRE2ts MEFs transduced with non-targeting control or Mad2/2 shRNAs 

and subsequently with control, Ctip or Exol shRNAs (error bars denote s.e.m.). 
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Extended Data Figure 6 | Related to Figs 3 and 4. a, TSCE analysis in 
shRNA-transduced TRF2ts MEFs grown at 32°C or for 12h at 39°C to 
uncap telomeres (n = 2, mean + s.d., counting >1,000 chromosomes per 
condition, per experiment). TSCE frequency in control cells is set at 1 
(corresponding to an average of 6.9% of chromosomes with a TSCE event). 
Shown on the right are examples of chromosomes without and with TSCE in 
cells quantified on the left. Original magnifications, <63.b, qRT-PCR analysis 
of Mad2I2 expression levels in 53bp1 ’ , Rifl ’, Ptip ’ and Ptip*’* 
MEFs used in the chromosome fusion analysis shown in Fig. 4a and in c 
(error bars denote s.d.). c, Percentage of chromosomes fused upon TRF2 
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inhibition in the Ptip*’* MEFs matching with the Ptip ‘" MEFs shown in 
Fig. 4a (n = 2, mean = s.e.m.). d, Analysis of different types of telomere fusions 
in Rifl’" MEFs. Depletion of MAD2L2 in Rifl~’~ MEFs does not reduce 
inter-chromosomal telomere fusions induced by TREF2 inhibition, indicating 
epistasis. However, irrespective of TRF2 inhibition, MAD2L2 depletion in 
Rifl~-’~ MEFs induces association between sister telomeres, causing an increase 
in total fusions scored for MAD2L2-depleted Rifl ~/~ MEBs, as also visible in 
Fig. 4a (n = 2, mean + s.e.m., >1,300-2,000 chromosomes were analysed 

per condition, per independent experiment). e, Explanation of scoring different 
types of telomere fusions shown in d. 
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Extended Data Figure 7 | Related to Fig. 4. a, Schematic overview of 
C-terminal and N-terminal deletion mutants of MAD212. b, c, Expression 
analysis in U2OS cells of GFP-tagged wild-type MAD2L2 and C- and 
N-terminal MAD2L2 deletion mutants (b) and of Flag-tagged wild-type 
MAD2L2 and MAD2L2(Cys70Arg) and MAD2L2(Leu186Ala) (c). d, Analysis 
of DDR foci formation of GFP-tagged wild-type MAD2L2 and C- and 
N-terminal MAD2L2 deletion mutants by immunofluorescence detection of 
GFP and yH2AX in U20S at 3 h after ionizing radiation (n = 2 for N-terminals, 
n= 3 for C-terminals, mean ~ s.e.m.). e, Analysis of wild-type, Cys70Arg and 
Leul86Ala MAD2L2 accumulation into DDR foci by immunofluorescence 
detection of MAD2L2 and 53BP1 in U2OS at 3h after ionizing radiation 

(n = 2, + s.e.m.). f, g, Expression analysis in TRF2ts MEFs of GFP-tagged 


MAD212 and C- and N-terminal MAD212 deletion mutants (f) and of Flag- 
tagged wild-type MAD2L2 and MAD2L2(Cys70Arg), MAD2L2(Leu186Ala), 
MAD2L2(Thr103Ala) and MAD2L2(Thr103Asp) (g). h, Quantification of 
chromosome fusions after 24h of telomere deprotection at 39 °C in TRF2ts 
MEFs transduced with control or Mad2/2 shRNAs and complemented with 
empty vector control or RNAi-resistant GFP-tagged wild-type MAD2L2 and 
C- or N-terminal MAD2L2 deletion mutants (error bars denote s.e.m.). 

i, Quantification of survival assays of TRF2ts MEFs transduced with control 
or Mad2/2 shRNAs and subsequently with empty vector control, 

wild-type MAD2L2, MAD2L2(Cys70Arg), MAD2L2(Leu186Ala), 
MAD2L2(Thr103Ala) or MAD2L2(Thr103Asp) retroviruses (n = 2, 

mean + s.e.m.). 
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Extended Data Figure 8 | Related to Fig. 4. a, Representative images of 
immunofluorescence detection of endogenous MAD2L2 and yH2AX or 53BP1 
in U20S cells transduced with control, 53BP1, RNF8, RNF168, RIF1, PTIP 
or REV3 shRNA lentiviruses, irradiated with 5 Gy and processed for 
immunofluorescence after 3h (quantifications are shown in Fig. 4e). 

b, Western blot or qRT-PCR analysis of PTIP, RNF8, RNF168 and REV3 levels 
in shRNA-transduced U20OS cells (error bars denote s.d.). c, Quantification 
and representative images of immunofluorescence detection of YH2AX 

and GFP-MAD212 in Ptip*’* or Ptip ‘~ MEFs, 3h after 5 Gy (n = 2, 


mean + s.d.). d, Representative images of immunofluorescence for 53BP1 
and exogenous Flag~-MAD2L2 wild-type, Cys70Arg or Leul86Ala mutants 
in U2OS cells depleted for endogenous MAD2L2 with lentiviral shRNA, 
processed for immunofluorescence 3 h after 5 Gy (quantifications are 

shown in Extended Data Fig. 7e). e, Quantification and representative 
immunofluorescence images of GFP-MAD212 localization to uncapped 
telomeres in TRF2ts MEFs transduced with control or 53bp1 shRNAs (n = 2, 
mean + s.d.). Original magnifications, X63 with zoom factor 5. 
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Extended Data Figure 9 | Related to Fig. 4. a, Schematic representation of 
the 53BP1 alleles with wild-type and substituted S/TQ sites used to address 
MAD212 IRIF localization dependence on 53BP1. Representative 
immunofluorescence images are displayed that show colocalization of 
endogenous MAD212 with 53BP1-DB-WT and -8A, but not with 53BP1-DB- 
7A or -28A. Quantifications are provided in Fig. 4f. b, Top, representative 
immunofluorescence images showing colocalization of endogenous RIF1 with 
53BP1-DB-WT and -8A, but not 53BP1-DB-7A or -28A. Quantifications are 
presented in Fig. 4f. Bottom, representative immunofluorescence images 
showing colocalization of PTIP-GFP with 53BP1-DB-WT and -7A, but 


S: 2.3% +1.1 


G2: 82.5% 43.3 


impaired colocalization of PTIP-GFP with 53BP1-DB-8A or -28A. 

c, Schematic representation of GFP-tagged wild-type RIF1 and GFP-tagged 
RIF1 lacking the N-terminal HEAT repeats (AHEAT), used to address 
MAD212 IRIF localization dependence on the HEAT repeats of RIF1. 
Representative immunofluorescence images are shown. Quantifications are 
presented in Fig. 4g. d, Cell cycle phase distributions of the RPE cells used in 
Fig. 4h to address cell cycle dependence of endogenous RIF1 and MAD2L2 
localization to IRIFs (n = 3, mean + s.d.). Original magnifications, X63 with 
zoom factor 5. 
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Extended Data Figure 10 | Related to Fig. 4. a, Frequency distribution (left) __c, Distance travelled by 10 representative uncapped telomeres for each 

and scatter plot (right) of total distances travelled by uncapped telomeres in condition. While multiple uncapped telomeres in 53BP1-depleted cells have 
control or Mad2I2 knockdown cells. b, RT-PCR analysis of Mad2I2 reduced mobility, this is not seen for uncapped telomeres in MAD2L2-depleted 
expression levels and western blot analysis of 53BP1 proteins levels in TRF2ts cells. d, Model of the role of MAD2L2 in promoting NHE). 

cells used in the experiments shown in a and c (error bars denote = s.d.). 
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REV7 counteracts DNA double-strand break 
resection and affects PARP inhibition 
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Error-free repair of DNA double-strand breaks (DSBs) is achieved 
by homologous recombination (HR), and BRCA1 is an important 
factor for this repair pathway’. In the absence of BRCA1-mediated 
HR, the administration of PARP inhibitors induces synthetic 
lethality of tumour cells of patients with breast or ovarian cancers”. 
Despite the benefit of this tailored therapy, drug resistance can 
occur by HR restoration’. Genetic reversion of BRCA1-inactivating 
mutations can be the underlying mechanism of drug resistance, but 
this does not explain resistance in all cases’. In particular, little is 
known about BRCA1-independent restoration of HR. Here we show 
that loss of REV7 (also known as MAD21L2) in mouse and human cell 
lines re-establishes CTIP-dependent end resection of DSBs in BRCA1- 
deficient cells, leading to HR restoration and PARP inhibitor 
resistance, which is reversed by ATM kinase inhibition. REV7 is 
recruited to DSBs in a manner dependent on the HZAX-MDCI1- 
RNF8-RNF168-53BP1 chromatin pathway, and seems to block 
HR and promote end joining in addition to its regulatory role in 
DNA damage tolerance®. Finally, we establish that REV7 blocks 
DSB resection to promote non-homologous end-joining during 
immunoglobulin class switch recombination. Our results reveal an 
unexpected crucial function of REV7 downstream of 53BP1 in 
coordinating pathological DSB repair pathway choices in BRCA1- 
deficient cells. 

To identify mechanisms of BRCA1-independent restoration of the HR 
pathway, we carried out a loss-of-function short hairpin RNA (shRNA) 
screen using the KB1P-B11 and KB1P-G3 cell lines that we previously 
derived from Brcal‘~ p53 ’~ (p53 is also known as Trp53) mouse 
mammary tumours’ (Fig. 1a and Supplementary Table 1). Resistant 
cells were selected with a high concentration of olaparib (500 nM, about 
100-fold the half-maximal inhibitory concentration (ICs9)), which killed 
cells of the empty vector control. Sequencing of the olaparib-surviving 
colonies revealed a reproducible enrichment of various individual 
hairpins targeting Rev7 or 53bp1 (also known as Trp53bp1). To validate 
the Rev7 hit, we introduced two different hairpins into the KB1P-B11 
and KB1P-G3 cell lines; these substantially inhibited Rev7 expression 
(Fig. 1b, c and Extended Data Fig. 1a). Despite the role of REV7 in 
metaphase-to-anaphase transition’, the level of Rev7 inhibition in these 
cells did not affect proliferation (Extended Data Fig. 1b, c), allowing 
long-term clonogenic survival assays. We confirmed that loss of Rev7 
resulted in increased resistance to the PARP inhibitors (PARPi) ola- 
parib and AZD2461 (ref. 7) in both cell lines (Fig. 1d and Extended Data 


Fig. 1d—-g). Resistant cells that survived olaparib treatment (Rev7 sh1/ 
2-ola) yielded even lower REV7 expression levels and increased numbers 
of colonies after PARPi treatment (Fig. 1b-d and Extended Data Fig. 1h). 
When we reconstituted the Rev7-depleted cells with shRNA-resistant 
Rev7 complementary DNA resulting in similar REV7 protein levels 
(Extended Data Fig. 1i), we successfully re-sensitized the tumour cells 
to PARPi (Fig. le, f). 

Tumours derived from the Brcal~’~ p53 ~’~ cells with stable Rev7 
inhibition also showed olaparib resistance in vivo, in contrast to the 
empty vector controls (Fig. 1g and Extended Data Fig. 1j-1). In addition, 
we found that Rev7 loss explains some cases of in vivo acquired PARPi 
resistance in BRCA1-deficient mouse mammary tumours (data not 
shown). REV7 depletion also resulted in PARPi resistance of the human 
BRCA\1-deficient cell line SUM149PT (Extended Data Fig. 2). To- 
gether, these data strongly indicate that inhibition of Rev7 confers 
PARPi resistance in BRCA1-deficient tumour cells. 

Together with the catalytic subunit REV3, REV7 forms the transle- 
sion synthesis polymerase ¢ (PolC), and it interacts with REV1 (ref. 9). 
We therefore investigated whether loss of REV1 or REV3 also confers 
PARPi resistance in Breal ’~ p53‘ cells. A 60% inhibition of Rev1 
or Rev3 transcripts did not cause olaparib resistance (Extended Data 
Fig. 3a-d). Moreover, we studied various shRNA-resistant REV7 mu- 
tants that lack REV1 (Leu186Ala/Gln200Ala/Tyr202Ala and a 1-183- 
amino-acids truncated protein) or REV3 (Cys70Arg) binding sites’? 
In contrast to the truncated 1-140-amino-acid REV7 protein, these 
mutants are recruited to DNA damage sites (Extended Data Fig. 3e-g), 
and their expression in Rev7 shRNA KB1P-B11 and KB1P-G3 cells sig- 
nificantly restored the sensitivity to PARPi to a degree approaching 
that of wild-type REV7 (Fig. 2a, b; P< 0.001, t-test). The remaining 
differences of the Leu186Ala/Gln200Ala/Tyr202Ala or Cys70Arg mu- 
tants with wild-type REV7 may be explained by unequal expression levels 
(Extended Data Fig. 3f). These data suggest that the REV1 or REV3 in- 
teraction is not absolutely required for the REV7-mediated function in 
this context. 

We observed that green fluorescent protein (GFP)-tagged REV7 co- 
localizes with 53BP1 shortly after DNA damage induction, suggesting 
that REV7 acts directly at the site of DNA damage (Fig. 2c). REV7 re- 
cruitment depends on H2AX, MDC1, RNF8, RNF168 and partly ATM, 
in both mouse and human cells (Fig. 2d, e and Extended Data Fig. 4a—d). 
To examine whether PARPi resistance in Rev7-depleted Brcal”’~ p53’ 
tumour cells is due to HR restoration, we investigated RAD51 focus 
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Figure 1 | Identification of loss of Rev7 in PARPi-resistant Brca1~/~ 

p53 ~’~ mammary tumour cells. a, Design of the functional shRNA screen. 
gDNA, genomic DNA. b, c, Quantification of Rev7 transcript (b) or protein 
(c) levels in KB1P-G3 cells transduced with Rev7-targeting shRNAs or the 
vector control. Hprt was used as a control for transcript expression, and 
B-tubulin was used as a control for protein expression. The data represent the 
mean ~ s.d. d, e, Long-term clonogenic assay using KB1P-G3 cells transduced 
with the indicated constructs (wt Rev7 stands for pLenti6-wt Rev7) and 
treatments. f, Quantification of the clonogenic assay in e by determining the 
absorbance of crystal violet at 590 nm. All the groups were normalized to 

the absorbance of the vector control. The data represent the mean + s.d. 

g, Overall survival of mice with KB1P-G3-derived Rev7-depleted or control 
tumours treated with one regimen of 50 mg olaparib per kilogram daily for 
28 days or left untreated. The P value was calculated using the log-rank test. 


formation after 10 Gy ionizing radiation. As shown in Fig. 3a, b and 
Extended Data Fig. 4e, f, Rev7 loss resulted in the restoration of RAD51 
foci formed after DNA damage. To exclude potential off-target effects 
of the hairpins, we reconstituted Rev7 shRNA1 and shRNA2 cells with 
shRNA-resistant mouse or human REV7-GFP fusion proteins (Ex- 
tended Data Fig. 4g). REV7 re-expression abolished RAD51 focus for- 
mation after DNA damage in GFP-positive cells (Fig. 3b). We confirmed 
the re-appearance of RAD51 foci after tumour irradiation in vivo using 
computed tomography (CT)-guided high precision cone beam irradi- 
ation of animals carrying PARPi-resistant KB1P(M) tumours with low 
Rev7 gene expression (Fig. 3c). 

Wethen tested whether the processing of broken DNA ends requires 
ATM in Rev7-depleted cells, and found that inhibition of ATM using 
KU55933 efficiently suppresses DNA-damage-induced RADS51 foci and 
increases olaparib sensitivity (Extended Data Fig. 4h, i). Hence, the 
partial restoration of RAD51 focus formation in Brca1-deficient mam- 
mary tumour cells after DNA damage by inhibition of Rev7 is ATM- 
dependent. 

In contrast to the results with BRCA1-deficient cells, Rev7 depletion 
in BRCA2-deficient cells did not result in PARPi resistance (Extended 
Data Fig. 5a-f). Furthermore, we did not observe increased PARPi 
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Figure 2 | Dissection of REV7 function and its dependent factors. 

a, b, Long-term clonogenic assay (a) and quantification (b) using KB1P-G3 
cells transduced with the indicated constructs (wt Rev7 stands for pMSCV- 
GFP-wt Rev7) and treatments. All groups were normalized to the absorbance of 
the shRev7-GFP control. The data represent the mean + s.d. c, GFP-REV7 
recruitment to sites of DNA damage (visualized by 53BP1—mCherry) was 
observed 5 min after 405 nm laser exposure (0.99 mW, 60% laser power, 50 s) in 
KB1P-B11 cells. pEGFP denotes a mammalian expression vector containing 
enhanced GFP. Scale bar, 5 um. d, REV7 foci formation in H2ax /— (also 
known as H2afx’~), Atm ‘~, Mdc1~‘~ and Rnf8 ‘~ mouse embryonic 
fibroblast (MEF) cells and their corresponding controls before and 4h after 
10 Gy ionizing radiation (IR). DAPI, 4’ ,6-diamidino-2-phenylindole. Scale bar, 
10 pm. e, Quantification of REV7 foci formation (>8 foci per cell) in 
Atm‘ and Atm‘!* MEF cells. The quantification of foci-positive cells 

was performed by counting a total of 100 cells per sample. Data are presented as 
mean + s.d. from three different experiments. P value calculated using 

the t-test. 


resistance after Rev7 inhibition in the BRCA1/2-proficient p53’ tumour 
cell line KP3.33 (Extended Data Fig. 5g-i). This indicates that REV7 
works upstream of BRCA2 and is antagonized by BRCA1. We there- 
fore tested whether DNA end resection is altered in the absence of Rev7 
in BRCA1-deficient cells. Accumulation of the single-strand binding 
protein, RPA, was used as a marker for the generation of single- 
stranded DNA (ssDNA). Cells were exposed to « particles’’, and 
BRCAI deficiency resulted in a marked decrease in RPA-positive o tracks 
compared to BRCA1-proficient cells (Fig. 3d, e). REV7 depletion in the 
Brcal~’~ p53 ’~ cells largely suppressed this defect in both KB1P-G3 
and KB1P-B11 cells (Fig. 3e and Extended Data Fig. 6a). In addition to 
coating resected DNA ends, RPA also interacts with ssDNA gaps (for 
example, during replication)’. To exclude that the observed RPA accu- 
mulation reflected interaction with internal ssDNA gaps, we prevented 
DNA end resection (without influencing replication) by knocking 
down CTIP (ref. 14). This eliminated the increase in RPA-positive 
tracks induced by Rev7 knockdown in Brcal~’~ p53 ~~ cell lines 
(Fig. 3f and Extended Data Fig. 6b), without affecting cell cycle distri- 
bution (Extended Data Fig. 6c). We therefore conclude that increased 
resection and not binding to ssDNA gaps is responsible for RPA 
accumulation. 
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Figure 3 | The effect of REV7 inhibition on RAD51 and RPA focus 
formation of Brcal~/— ps3’ ~ cells. a, RAD51 focus (red) formation in 
KB1P-G3 cells before and 5h after 10 Gy ionizing radiation. Scale bar, 10 jum. 
b, Quantification of RAD51 foci in KB1P-G3 cells (with or without REV7 
depletion) transfected with an empty vector (GFP) or vectors containing mouse 
or human Rev7 or REV7, respectively. At least 150 GFP-positive cells were 
analysed per group in three independent experiments each. The data represent 
the mean ~ s.d. IR denotes 5 h after 10 Gy ionizing radiation. c, In situ analysis 
of RADS51 foci in PARPi-resistant KB1P(M) tumours with low Rev7 gene 
expression. KP (KP3.33) denotes mouse mammary tumour cell line (p53); 
IR denotes 2h after 15 Gy ionizing radiation; NIR denotes no ionizing 
radiation. ****P < 0.0001, Mann-Whitney U test. d, e, Representative images 
of 53BP1-labelled « tracks in cells positive or negative for RPA (d) and 
quantification of RPA-positive tracks 2h after ionizing radiation (e). KP 

(p53 ‘~) or KB1P-G3 cells with or without Rev7-targeting shRNAs were tested. 
Scale bar, 5 jum. f, Quantification of RPA- and 53BP1-positive o tracks in 
KB1P-G3 cells transfected with non-targeting control (ctrl) short interfering 
RNAs (siRNAs) or siRNAs against mouse Ctip. CTIP protein expression 

of the indicated groups was checked by western blotting, with B-actin 
(ACTB) as a loading control. g, Quantification of HR using the DR-GFP 
reporter assay. GFP-positive cells normalized to the vector-transduced 
Brcal-’~ p53 shRNA cells are shown. The data represent mean + s.d. 

**D < ().01, two-tailed t-test. 


As Rev7 loss could restore end resection in BRCA1-deficient cells, 
we analysed whether its depletion could restore full HR proficiency in 
this context. Using mouse embryonic stem (mES) cells with a Brcal 
selectable conditional knockout allele’’, we observed that Rev7 loss in- 
deed prevented cell death of mES cells after BRCA1 deletion, and restored 
RAD51 focus formation upon DNA damage (Extended Data Fig. 6d-h). 
Moreover, we reproducibly observed a partial restoration of HR function 
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in the DR-GFP reporter assay for homologous recombination’® when 
Rev7 was depleted in BRCA1-deficient mES cells (Fig. 3g). 

Our data for REV7 are reminiscent of previous findings that 53BP1 
loss occurs in subsets of human breast carcinomas" and can also restore 
HR to BRCA1-deficient cells”’*'”. As with 53BP1, we found a frequent 
aberrant reduction or loss of the REV7 protein in human triple-negative 
breast carcinomas (Extended Data Fig. 7). Addressing the relationship 
between REV7, 53BP1 and the 53BP1 non-homologous end-joining 
(NHE)) effector protein, RIF1 (ref. 18), we found that Rev7 deficiency 
did not compromise the formation of endogenous 53BP1 or RIF1 foci 
(Extended Data Fig. 8a—d). By contrast, endogenous REV7 foci or laser- 
induced stripes were absent in 53BP 1-depleted mouse and human cells 
(Fig. 4a and Extended Data Fig. 4c, d), strongly suggesting that REV7 
acts downstream of 53BP1. This is also consistent with our results that 
PARPi resistance is not increased when both Rev7 and 53bp1 are de- 
pleted (data not shown). Despite such strong evidence for a cooperative 
role for REV7 and 53BP1, we did not detect REV7 in 53BP1 immuno- 
complexes isolated from untreated cell lysates, or ATM-phosphorylated 
53BP1 immunocomplexes containing RIF1 that were induced by DNA 
damage’’ (Extended Data Fig. 8e and data not shown). Although the 
intricacies of the interactions remain to be determined, REV7 recruit- 
ment to DNA damage sites by 53BP1 may result from indirect inter- 
actions or an activity elicited by 53BP1 protein complexes in chromatin 
at DSB sites. 
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Figure 4 | REV7 is a downstream effector of 53BP1 on inhibiting end 


resection and promoting CSR. a, REV7 foci formation in 53bp1 /~ and 
53bp1'’* MEF cells before and 4h after 10 Gy ionizing radiation. Scale bar, 
10 pm. b, Quantification of CSR to IgA of shRNA-transduced CH12 cells 40h 
after stimulation (CIT denotes CD40 antibody, IL-4 and TGF-B1). Data 
represent mean + s.d. from two independent experiments performed in 
triplicate. c, Schematic of IgH locus shows relative positions of quantitative 
PCR amplicons used in ChIP experiments. A control non-IgH locus (Rpp30) 
was also examined. Indicated CH12 cell lines stimulated for 30h with CIT 
were subjected to ChIP with IgG (control), histone H2AX and RPA32 
monoclonal antisera. After background subtraction, values were normalized to 
the DNA input signals, followed by the maximum value in each data set. 
Mean signals, two replicates + s.e.m. 
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To examine whether REV7, like 53BP1 (ref. 18), also promotes NHEJ 
of DSBs during class switch recombination (CSR), we depleted Rev7 
transcripts in the mouse CH12 B-cell line, which after stimulation 
undergoes CSR from IgM to IgA at a high rate”’. Efficient Rev7 knock- 
down was achieved using several shRNAs, reducing CSR to levels com- 
parable with 53BP1-depleted cells when compared to control-depleted 
cells (Fig. 4b and Extended Data Fig. 9a, b). Moreover, these defects 
were not accompanied by defects in cell proliferation, Aid (also known 
as Aicda), or germ-line transcript (UGLT/%GLT) expression (Extended 
Data Fig. 9c—e). Conditional REV3-ablation has been reported to reduce 
CSR efficiency in B lymphocytes”, suggesting that distinct from 53BP1, 
REV7 might participate in CSR through Pol€ function. This considered, 
we reasoned that we might separate the function of REV7 from that of 
Pol€ during CSR, at the level of DSB resection inhibition. To this end, 
RPA enrichment was measured by chromatin immunoprecipitation 
(ChIP) at control and several IgH loci in control, 53BP1-, REV3- or 
REV7-depleted CH12 lines stimulated to undergo CSR. Consistent with 
the role of 53BP1 in resection inhibition, 53BP1-depletion was accom- 
panied by a 3-5-fold enrichment of RPA ChIP signal specifically in 
donor (Su) and acceptor (Sa) IgH switch (S) regions where DSBs occur 
during IgM to IgA CSR, but not at an IgH Sy1 locus or a control non- 
IgH Rpp30 locus (Fig. 4c). Notably, these defects were closely mimicked 
after REV7 depletion, whereas shRNA-mediated REV3 depletion 
yielded no detectable increase in RPA IgH S-region enrichment, des- 
pite diminishing CSR efficiency (Extended Data Fig. 9f) as expected’. 
Importantly, at each locus equivalent ChIP signals for total histone were 
obtained between cell lines. Thus, our data support a PolC-independent 
function of REV7 in inhibiting the nucleolytic processing of DSBs 
generated during CSR. 

Our data uncover a crucial role for REV7 in regulating DSB repair. 
REV7 depletion restores homology-directed DNA repair of BRCA1- 
deficient cells resulting in PARPi resistance. We attribute this result to 
the inhibitory effect of REV7 on DNA end resection. Like 53BP1 to- 
gether with RIF1 and PTIP (ref. 18), REV7 may function as a NHE] fac- 
tor that performs a regulatory role in DSB repair pathway choice. This 
provides new insight into the versatile functions of REV7 in addition to 
its role in translesion synthesis”, and helps to explain how HR can be 
partially restored in the absence of BRCA1. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Cell culture and reagents. KB1P-B11 (B11) and KB1P-G3 (G3) cell lines were 
derived from a Breal’~ p53 /~ mouse mammary tumour as described’. KB2P- 
1.21 and KB2P-3.4 cell lines originate from a Brcal ~/~ p53-’~ mouse mammary 
tumour, and KP3.33 cell line from a p53 ’~ mouse mammary”. These cell lines 
were cultured in DMEM/F-12 (Life Technologies) supplemented with 10% FCS, 
50 U ml * penicillin, 50 ng ml streptomycin, 5 pg ml’ insulin (Sigma), 5 ng ml * 
epidermal growth factor (Life Technologies) and 5 ng ml ' cholera toxin (Gentaur) 
under low oxygen conditions (3% O2, 5% CO, 37 °C). SUM149PT cells were grown 
in RPMI1640 (Life Technologies) supplemented with 10% FCS, under normal oxy- 
gen conditions (21% O, 5% COz, 37 °C). U20S, phoenix, 293T cells were cultured 
in DMEM (Life Technologies) supplemented with 10% FCS, under normal oxygen 
conditions (21% O, 5% CO, 37 °C). mES cells with a selectable conditional BRCA1 
deletion (R260 88 !2"": Brca1S-"4)5 were cultured on gelatin-coated plates in 
60% buffalo red liver cell-conditioned medium supplied with 10% FCS, 0.1 mM B- 
mercaptoethanol (Merck) and 1 x 10° U ml! ESGRO LIF (Millipore) under nor- 
mal oxygen conditions (21% O2, 5% COs, 37 °C). CH12 cells (CH12F3-2) were 
cultured in RPMI-1640 medium (Sigma) supplemented with 10% FBS, 5% NCTC 
109 medium (Sigma), 50 1M B-mercaptoethanol, 50 U ml~ : penicillin and 50 ng ml~ ‘ 
streptomycin under normal oxygen conditions. Olaparib and AZD2461 were pro- 
vided by AstraZeneca; KU-55933 (KuDOS) was bought from Selleckchem ($1092). 
Lentivirus-based transduction of cells with shRNA. Glycerol stocks of shRNA 
hairpins were obtained from the Sigma Mission library (TRC 1.0) and isolation of 
plasmids was carried out with the high pure plasmid Mini Kit or Genopure maxi 
kit (Roche). 293T cells were seeded 16h before transfection. For each 10-cm dish, 
0.5 ml 2X HBS (8.18 g NaCl, 0.2 g Na, HPO,-7H,0, 5.95 g HEPES in 500 ml MilliQ 
water at pH 7.01) was added into a sterile falcon tube. In another sterile falcon tube, 
6 ug plasmid DNA of interest, 2 ug pHCMV-G envelope vector (pMD.G), 2 1g 
pRSV-Rev, 2 tg packaging vector pMDLg/pRRE, 250 ll 0.5 M CaCl, and distilled 
water were added to bring up to 0.5 ml. The CaCl,/plasmid DNA mix was added to 
the 2X HBS and incubated for 20 min and then added to the cells. Medium was 
refreshed after 6 h and another 18 h, respectively. The supernatant of 293T cells con- 
taining lentivirus was collected after 24h to infect cells with polybrene (6 jg ml) 
for 12h. The medium was refreshed after lentivirus infection and the cells were 
selected with puromycin. 

Individual shRNA vectors used were collected from the TRC library. Mouse 
Rev7: shl: TRCN0000012844_CCAGTGGAGAAGTTTGTCTTT; sh2: TRCN 
0000012846_CATCTTCCAGAAGCGCAAGAA; sh3: TRCN0000012847_GAT 
ACAGGTCATCAAGGACTT; human REV7: sh1: TRCN0000006569_CCCTGA 
TTCCAAGTGCTCTTA; sh2: TRCN0000006570_CCCGGAGCTGAATCAGTA 
TAT; sh3: TRCN0000006571_CCCAGTGGAGAAATTCGTCTT; sh4: TRCNO0 
00006573_CATCTTCCAGAAACGCAAGAA; mouse 53bp1: (puromycin) sh: 
TRCN0000081778_GCTATTGTGGAGATTGTGTTT; (neomycin) sh: same se- 
quences as above; human 53BP1: sh1: TRCN0000018866_CCAGTGTGATTAGT 
ATTGATT; sh2: TRCN0000018865_GATACTTGGTCTTACTGGTTT; mouse 
p53: (neomycin) TRCN0000054551_AGAGTATTTCACCCTCAAGAT; mouse 
Revi: sh1: TRCN0000120298_GCCGAGATCAACTATGGAATA; sh2: TRCN 
0000120297_CAGCAGTGCTTGTGAGGTATT; mouse Rev3: shl: TRCN0000 
119969_CCGTCACATTAGTGAGACTAT; sh2: TRCN0000119970_GCCCAC 
ATACACTTTCTTCTT. 

Loss-of-function screen. In total, 1,976 lentiviral hairpins (pLKO.1) from the Sigma 
Mission library (TRC Mm 1.0) that target 391 mouse genes involved in the DNA 
damage response were selected (see Supplementary Table 1). This library was used 
to generate pools of lentiviral shRNA in 293T cells to infect target cells. After in- 
fection, the cells stably expressing integrated shRNA were selected with puromy- 
cin. Cells with HR restoration were selected with a high concentration of olaparib 
(500 nM, about 100-fold the ICso), which killed cells of the empty vector group. 
Surviving cells were pooled and genomic DNA was extracted using the Gentra 
Puregene kit according to the manufacturer’s protocol (Qiagen). shRNA inserts 
were retrieved from 50 ng genomic DNA by PCR amplification (PCR1 and PCR2) 
using the following conditions: (1) 95 °C, 5 min; (2) 95 °C, 30 s; (3) 60 °C, 30 s; (4) 
72°C, 1 min; (5) go to step (2), 20 cycles; (6) 72 °C, 5 min; (7) 4°C. The PCR re- 
action system were as follows: 0.6 11 DMSO, 4.0 jl phusion HF buffer 5x, 0.4 ul 
dNTPs, 1.0 pl primer f (10 mM), 1.0 pl primer r (10 mM), 11.8 pl mQ, 0.2 ul phu- 
sion,1.0 pl gDNA for PCR1 or 1 pl PCR1 products for PCR2. Adaptors and indexes 
for deep sequencing (Illumina HiSeq 2000) were incorporated into PCR primers as 
follows: PCR1 forward: PCR1_01_PLKO1_f_Integration determination_1, 5’-AC 
ACTCTTTCCCTACACGACGCTCTTCCGATCTCGTGATCTTGTGGAAAG 
GACGAAACACCGG-3’; PCR1_02_PLKO1_f_Integration determination_2, 5’- 
ACACTCTTTCCCTACACGACGCTCTTCCGATCTGTAGCCCTTGTGGAAA 
GGACGAAACACCGG-3’; PCR1_03_PLKO1_f_untreated_1, 5’-ACACTCTT 
TCCCTACACGACGCTCTTCCGATCTCACTGTCTTGTGGAAAGGACGAA 
ACACCGG-3’; PCR1_04_PLKO1_f_untreated_2, 5’-ACACTCTTTCCCTAC 


LETTER 


ACGACGCTCTTCCGATCTATTGGCCTTGTGGAAAGGACGAA ACACCGG 
-3'; PCR1_05_PLKO1_f_olaparib_1, 5’-ACACTCTTTCCCTACACGACGCTC 
TTCCGATCTGATCTGCTTGTGGAAAGGACGAAACACCGG-3’; PCR1_06_ 
PLKO1_f_olaparib_2, _5'-ACACTCTTTCCCTACACGACGCTCTTCCGATC 
TTCAAGTCTTGTGGAAAGGACGAAACACCGG-3';  PCR1_07_PLKO1_f_ 
olaparib_3, 5'-ACACTCTTTCCCTACACGACGCTCTTCCGATCTCTGATC 
CTTGTGGAAAGGACGAAACACCGG-3’; PCR1_08_PLKO1_f_olaparib_4, 5’- 
ACACTCTTTCCCTACACGACGCTCTTCCGATCTAAGCTACTTGTGGAA 
AGGACGAAACACCGG-3'; PCR1 reverse: P7_pLKO1_r, 5’-CAAGCAGAAG 
ACGGCATACGAGATTTCTTTCCCCTGCACTGTACCC-3’; PCR2 forward: 
P5_IlluSeq, 5'-AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTA 
CACGACGCTCTTCCGATCT-3’; PCR2 reverse: P7, 5’-CAAGCAGAAGACGG 
CATACGAGAT-3’. 

PCR2 products were purified using the PCR purification kit from Qiagen. The 

shRNA stem sequence was segregated and aligned to the TRC library. The reads of 
different hairpins were counted and the following criteria were used to select the 
top hits for further validation: (1) hairpins targeting the same gene in survival clones 
should have at least 1 X 10* reads (total ~6 X 10° reads); (2) at least two different 
hairpins targeting the same gene should be present; (3) hairpins in resistant clones 
should be highly enriched (>8-fold) in cells after olaparib selection; and (4) hair- 
pins should be present in 4 out of 4 independent screens. 
PARPi treatment study. Long-term clonogenic assay: on day 0, 1.5 X 10* (B11) 
or 1 X 10* (G3, KB2P_1.21 or KB2P_3.4) or 6 X 10° (KP3.33) cells were seeded per 
well with PARPi (or untreated control) into six-well plates. The medium of the 
PARPi treatment groups was refreshed with PARPi every 4 days. On day 5, the un- 
treated control group was stopped and the PARPi treatment groups were stopped 
after another 2-3 weeks and stained with 0.1% crystal violet. Using the SUM149PT 
cells, 4 X 10* cells were seeded per well with olaparib (or untreated control) into 
12-well plates. The medium of the PARPi treatment groups was refreshed with 
olaparib every 4 days. On day 6, the untreated control group was stopped and the 
olaparib treatment groups were stopped on day 8 and stained with 0.1% crystal vio- 
let. Quantification of the clonogenic assay was done by determining the absorbance 
of crystal violet at 590 nm. 

Short-term clonogenic assay: on day 0, 4 X 10° (G3) cells were seeded per well 
with olaparib (or untreated control) into 6-well plates. On day 4, the medium was 
refreshed with olaparib or untreated control. On day 8, all the groups were stained 
with Leishman dye and quantification was done by the relative colony numbers. 
ATM and PARP inhibitor combination study. On day 0, 1 X 10* (G3) cells were 
seeded per well into 6-well plates and then ATM inhibitor or olaparib or their com- 
bination was added. The medium was refreshed every 3 days with the different drugs. 
For the combination therapy groups, ATM inhibitor was applied for 6 days. On 
day 5, ATM inhibitor alone and untreated control groups were stopped and the 
other groups were stopped on day 12 and stained with 0.1% crystal violet. 
Constructs. Human REV7 was amplified by PCR using Platinum Taq polymerase 
(Invitrogen) from U2OS cDNA using the following primers 5'-ATAGAATTC 
AATGACCACGCTCACACGACAAGAC -3’ and 5’-ATATGGTACCATCAGC 
TGCCTTTATGAGCGCGC-3’. Mouse Rev7 was amplified from mouse lung cDNA 
in two parts to introduce silent mutations, making it resistant to Rev7 shRNA2 
(mRev7R). The following primers were used for part A: [IB11m] 5’-ATATGAAT 
TCGATGACCACCCTCACGCGC-3’ [IB14m] 5’-TACTTCTTCCGTTTCTGA 
AAGATGCCCACCGGGTA-3’ and part B: [IB12m] 5’-ATATGGTACC-AT-G 
CTGTTCTTATGCGCTCGCT-3’ [IB13m] 5’-GGGCATCTTTCAGAAACGG 
AAGAAGTACAACGTGC-3’. Equal parts of both PCR reactions were mixed and 
used for a PCR using IB11m and IB12m to create the complete mouse Rev7 sequence 
including the silent mutations. 

The PCR product and pEGFP-C1 vector were digested using EcoRI and KpnI 
and ligated using the T4 DNA Ligase (Roche) to generate pEGFP-hREV7 and 
pEGFP-mREV7R. 

Using pEGFP-mREV7R as a template, pEGFP-C1-based REV7 truncated con- 
structs was amplified by PCR using the following primers: forward: 5'-ATATGA 
ATTCGATGACCACCCTCACGCGC-3’, reverse: mREV7R (1-55aa): 5'-ATAT 
GGTACCGGACATCTGAACCGGCAC-3’; mREV7R (1-81aa): 5'-ATATGGT 
ACCCTCCACATCGTTCTTCTCCAGG-3’; mREV7R (1-110aa): 5’-ATATGG 
TACCGATGGACAGCAAGGGAGGC-3’; mREV7R (1-140aa): 5'-ATATGGT 
ACCGTTGTGATCCAGGACAGC-3’; mREV7R (1-183aa): 5’-ATATGGTACC 
GTCGTGCATGTGGACATCCTG-3’. pEGFP-REV7 mutants (shRNA-resistant) 
were ordered as gBlocks (IDT) and cloned into the pEGFP-C1 vector using EcoRI 
and KpnI restriction enzymes and quick ligase (NEB). The DNA sequences that 
were ordered as gBlocks are: mREV7_shRNAresistant (mREV7R): CGCCGCG 
AATTCCGCCACCATGACCACCCTCACGCGCCAAGACCTCAACTTTGGC 
CAAGTGGTGGCTGACGTGCTCTCCGAGTTCCTGGAGGTGGCCGTGCAC 
CTGATTCTCTATGTGCGCGAGGTCTACCCGGTGGGCATCTTTCAGAAA 
CGGAAGAAGTACAACGTGCCGGTTCAGATGTCCTGTCACCCGGAGCTG 
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AACCAGTACATCCAGGACACACTCCACTGCGTCAAACCTCTCCTGGAG 
AAGAACGATGTGGAGAAGGTGGTGGTGGTGATTTTGGATAAGGAACAC 
CGCCCAGTGGAGAAGTTTGTCTTTGAGATCACTCAGCCTCCCTTGCTG 
TCCATCAATTCAGACTCCCTCCTGTCTCATGTGGAGCAGCTGCTTCGAG 
CCTTCATCCTTAAGATTAGTGTGTGTGATGCTGTCCTGGATCACAACC 
CTCCAGGCTGCACATTTACAGTCCTCGTGCACACAAGAGAAGCTGCTA 
CTCGAAACATGGAGAAGATACAGGTCATCAAGGACTTCCCATGGATC 
CTGGCAGATGAACAGGATGTCCACATGCACGACCCCCGCTTGATACCC 
CTAAAAACCATGACGTCGGACATTTTAAAGATGCAGCTCTACGTTGAA 
GAGCGAGCGCATAAGAACAGCTGAGGTACCCCGGG;, mREV7_shRNA 
resistant_L186A/Q200A/Y202A: CGCCGCGAATTCCGCCACCATGACCACC 
CTCACGCGCCAAGACCTCAACTTTGGCCAAGTGGTGGCTGACGTGCTC 
TCCGAGTTCCTGGAGGTGGCCGTGCACCTGATTCTCTATGTGCGCGAG 
GTCTACCCGGTGGGCATCTTTCAGAAACGGAAGAAGTACAACGTGCC 
GGTTCAGATGTCCTGTCACCCGGAGCTGAACCAGTACATCCAGGACAC 
ACTCCACTGCGTCAAACCTCTCCTGGAGAAGAACGATGTGGAGAAGG 
TGGTGGTGGTGATTTTGGATAAGGAACACCGCCCAGTGGAGAAGTTT 
GTCTTTGAGATCACTCAGCCTCCCTTGCTGTCCATCAATTCAGACTCCC 
TCCTGTCTCATGTGGAGCAGCTGCTTCGAGCCTTCATCCTTAAGATTAG 
TGTGTGTGATGCTGTCCTGGATCACAACCCTCCAGGCTGCACATTTAC 
AGTCCTCGTGCACACAAGAGAAGCTGCTACTCGAAACATGGAGAAGA 
TACAGGTCATCAAGGACTTCCCATGGATCCTGGCAGATGAACAGGATG 
TCCACATGCACGACCCCCGCGCTATACCCCTAAAAACCATGACGTCGG 
ACATTTTAAAGATGGCTCTCGCTGTTGAAGAGCGAGCGCATAAGAAC 
AGCTGAGGTACCCCGGG; mREV7_ shRNAresistant _C70R: CGCCGCGAA 
TTCCGCCACCATGACCACCCTCACGCGCCAAGACCTCAACTTTGGCCA 
AGTGGTGGCTGACGTGCTCTCCGAGTTCCTGGAGGTGGCCGTGCACCT 
GATTCTCTATGTGCGCGAGGTCTACCCGGTGGGCATCTTTCAGAAACG 
GAAGAAGTACAACGTGCCGGTTCAGATGTCCTGTCACCCGGAGCTGAA 
CCAGTACATCCAGGACACACTCCACCGCGTCAAACCTCTCCTGGAGAA 
GAACGATGTGGAGAAGGTGGTGGTGGTGATTTTGGATAAGGAACACC 
GCCCAGTGGAGAAGTTTGTCTTTGAGATCACTCAGCCTCCCTTGCTGT 
CCATCAATTCAGACTCCCTCCTGTCTCATGTGGAGCAGCTGCTTCGAG 
CCTTCATCCTTAAGATTAGTGTGTGTGATGCTGTCCTGGATCACAACC 
CTCCAGGCTGCACATTTACAGTCCTCGTGCACACAAGAGAAGCTGCTA 
CTCGAAACATGGAGAAGATACAGGTCATCAAGGACTTCCCATGGATC 
CTGGCAGATGAACAGGATGTCCACATGCACGACCCCCGCTTGATACCC 
CTAAAAACCATGACGTCGGACATTTTAAAGATGCAGCTCTACGTTGAA 
GAGCGAGCGCATAAGAACAGCTGAGGTACCCCGGG. Using pEGFP- 
mREV7R, C70R and L186A/Q200A/Y202A mutants as templates, Gateway com- 
patible pMSCV-GFP or pLenti6- UBC (Invitrogen) -based REV7 truncated or REV7 
mutated constructs were amplified by PCR using the following primers: forward: 
5'-GGGGACAACTTTGTACAAAAAAGTTGGCATGACCACCCTCACGCGC 
CAA-3’; reverse: MREV7R (full-length for wt REV7, C70R REV7, L186A/Q200A/ 
Y202A REV7): 5'-GGGGACAACTTTGTACAAGAAAGTTGGGTATTCAGC 
TGTTCTTATGCGCTCGCTC-3’; mREV7R(1-140aa): 5’-GGGGACAACTTT 
GTACAAGAAAGTTGGGTATTCAGTTGTGATCCAGGACAGC-3’; mREV7R 
(1-183aa): 5’-GGGGACAACTTTGTACAAGAAAGTTGGGTATTCAGTCGT 
GCATGTGGACATCCTG-3’. PCR products were purified using PCR purification 
kit (Roche) and then subjected to BP (BP-clonase II, Invitrogen) and LR (LR- 
clonase II, Invitrogen) reaction according to manufacturer’s instructions. Wild- 
type 53BP1 and 53BP17™*° constructs were previously described"”. All the constructs 
were verified by sequencing. 

siRNA, cDNA transfection and cDNA transduction. SMARTpool siRNAs target- 
ing mouse Ctip (siGENOME: M-055713-02, Thermo Scientific) and non-targeting 
control were transfected into cells using Dharmacon 1 transfection reagent (Thermo 
Scientific). After 48 h, cells were subjected to western blot and « track assay. siRNAs 
against 53bp1, Rnf& or Rnf168 were transfected into cells using Lipofectamine 
RNAiMax according to the manufacturer’s instruction. The sequences of 53bp1 
siRNA: 5’-GAGAGCAGAUGAUCCUUUA-3’ (ref. 24); Rnf8 siRNA: 5'-GGAC 
AAUUAUGGACAACAATT-3’ (ref. 25); Rnf168 siRNA: 5'-GGCGAAGAGCG 
AUGGAGGAtt-3’ (ref. 26); GFP siRNA: 5’-GGCUACGUCCAGGAGCGCACC 
TT-3’. Immunofluorescence and western blotting analysis was done 64h after 
transfection. 

For pEGFP-based constructs, transient transfection was done using X-treme 
GENE HP DNA transfection reagent (Roche) according to the manufacturer's 
manual. GFP-positive cells were sorted by flow cytometry and subjected to west- 
ern blotting and immunofluorescence staining. 

For pMSCV-GEP (retrovirus) -based constructs, transient transfection was done 
using X-treme GENE HP DNA transfection reagent in phoenix cells and the me- 
dium was refreshed after 24h. Retroviruses were collected 48 h after transfection, 
and then target cells were infected for two consecutive periods of 12 h using fresh 


virus. The medium was refreshed after retrovirus infection and the cells were 
selected with blasticidin. 

For pLenti6-UBC (lentivirus) based constructs, together with the packaging plas- 

mids p59, p60 and p61, transient transfection was done using X-treme GENE HP 
DNA transfection reagent in 293T cells and the medium was refreshed after 24h. 
Lentiviruses were collected 48 h after transfection and then target cells were infected 
for two consecutive periods of 12 h using fresh virus. The medium was refreshed 
after lentivirus infection and the cells were selected with blasticidin. 
Mice, generation of PARPi-resistant mouse mammary tumours. All mouse 
experiments were approved by the Animal Experiments Review Board of the Nether- 
lands Cancer Institute (Amsterdam), complying with Dutch legislation. Olaparib- 
resistant KB1P(M)- and AZD2461-resistant KB1P mouse mammary tumours were 
generated as described’. In this study we analysed a total of 55 PARPi-resistant and 
52 PARPi-sensitive tumours derived from 13 individual KB1P(M) and 10 indivi- 
dual KB1P donor tumours. 

To generate mouse mammary tumours from cell lines, 5 X 10° cells were ortho- 
topically transplanted into 6-week-old female wild-type FVB/N_Ola129 mice as 
reported previously’. Mice were randomized to the PARPi or untreated control 
groups. 

RT-qPCR. Total RNA was isolated from cells using the high pure RNA isolation 
kit (Roche). cDNA was made from 1 jig RNA with the GoScript Reverse Tran- 
scription system (Promega). For the quantitative PCR cDNA, Primer Fw&Rv (400 
nM) and Lightcycler 480 SYBR Green I Master (Roche) were applied in a Lightcycler 
480 96-well plate (Roche). The SYBR green signals were measured with Lightcycler 
480 II (Roche). The C, value of the gene of interest was subtracted from the house- 
keeping gene. This value was put in the power of 2 and this was also done for the s.d. 
The primer sequences used in this study are as follows: Mouse Hprt forward: 5'- 
CTGGTGAAAAGGACCTCTCG-3’; mouse Hprt reverse: 5'-TGAAGTACTCA 
TTATAGTCAAGGGCA-3’; mouse Rev7 forward: 5’-ACACTCCACTGCGTCA 
AACC-3’'; mouse Rev7 reverse: 5’-AAAGACAAACTTCTCCACTGGGC-3’; mouse 
Rev1 forward: 5'-ACAGGATTGCTTGGTGCCTGTG-3’; mouse Rev! reverse: 
5'-TGAAGTCCGCGTTGCTCTTCTC-3’; mouse Rev3 forward: 5'-AAGAGAT 
GTCACAGACAGGCCC-3’; mouse Rev3 reverse: 5’-AGTTAGACAGCCGCTG 
TTGTGC-3’; mouse aGLT forward: 5’-GACATGATCACAGGCACAGG-3’; 
mouse &GLT reverse: 5'-TTCCCCAGGTCACATTCATCGT-3’; mouse 1,GLT 
forward: 5’-TAGTAAGCGAGGCTCTAAAAAGCAT-3’; mouse GLT reverse: 
5'-AGAACAGTCCAGTGTAGGCAGTAGA-3’; mouse Aid forward: 5’-GAAA 
GTCACGCTGGAGACCG-3’; mouse Aid reverse: 5'-TCTCATGCCGTCCCTT 
GG-3’. Human HPRT-P1 (primers pair 1) forward: 5'-GCAGACTTTGCTTTC 
CTTGG-3'; human HPRT-P reverse: 5'’-ACACTTCGTGGGGTCCTTTT-3’; 
human HPRT-P2 forward: 5'-TGCTCGAGATGTGATGAAGG-3’; human 
HPRT-P2 reverse: 5'-AATCCAGCAGGTCAGCAAAG-3’; human REV7-P1 for- 
ward: 5’-TGCTGTCCATCAGCTCAGAC-3’; human REV7-P1 reverse: 5’-TCT 
TCTCCATGTTGCGAGTG-3’; human REV7-P2 forward: 5'-GCTCACACGAC 
AAGACCTCA-3’; human REV7-P2 reverse: 5’-GACCGGCACGTTGTACTT 
CT-3’; mouse 53bp1 forward: 5’-TCAGCCAAACAGGACAAGCA-3’; mouse 
53bp1 reverse: 5'-GCAGAATCTTCAGCAGCAAGG-3’. 

Western blotting. Cells were washed with ice-cold PBS and lysed on ice for 30 min 
with RIPA lysis buffer supplemented with three protease inhibitors (P8340, P5726, 
P0044; Sigma-Aldrich). Protein concentration was determined by the Bradford 
Protein Assay Kit (Bio-Rad) and a calibration standard curve created from the BSA. 
The samples were prepared for loading by adding 4X sample buffer (Invitrogen) 
and heating the samples at 70 °C for 10 min. Total proteins were separated by SDS- 
PAGE on 3-8% Tris-acetate (for 53BP, RIF1) or 4-12% Bis-Tris gradient gels (all 
others). Next, proteins in the gel were electrophoretically transferred to PVDF mem- 
brane (Millipore) (for REV7, ACTB, «-tubulin) or to NC membranes (Invitrogen) 
(all others) and then the membrane was blocked in 5% milk with Tris-buffered 
saline Triton X-100 buffer (100 mM Tris, pH 7.4, 500 mM NaCl, 0.1% Triton X-100) 
(TBS-T0.1%). Membranes were incubated with primary antibodies in 5% milk in 
TBS-T0.1% overnight at 4 °C. Horseradish peroxidase (HRP)-conjugated second- 
ary antibody incubation was performed for 1 h at room temperature in 5% milk in 
TBS-T0.1% and signals were visualized by ECL. Primary antibodies used in this 
study were as follows: mouse anti-REV7 (612266, BD Biosciences), 1:5,000 dilu- 
tion; rabbit anti-53BP1 (NB100-304, Novus), 1:1,000 dilution; rabbit anti-53BP1 
(A300-272A, Bethyl), 1:5,000 dilution; rabbit anti-CTIP (ab70163, Abcam), 1:1,000 
dilution; mouse anti-o.-tubulin (T6074, Sigma), 1:5,000 dilution; mouse anti-ACTB 
(MAB1501R, Millipore), 1:5,000 dilution; rabbit anti-mouse RIF1 (SK1316) (ref. 19), 
1:2,000 dilution; mouse anti-RNF8 (B-2, Santa Cruz), 1:1,000 dilution; rabbit anti- 
RNF168 (ref. 27), 1:5,000 dilution; mouse anti-GAPDH (1D4, GeneTex) 1:1,000 
dilution. Secondary antibodies used in this study were as follows: polyclonal rabbit 
anti-mouse immunoglobulins/HRP (P0161, Dako), 1:10,000 dilution; polyclonal 
swine anti-rabbit immunoglobulins/HRP (P0217, Dako), 1:10,000 dilution. 
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Immunofluorescence. Cells were grown on glass coverslips (12 mm) in 24-well 
plates. To induce ionizing radiation-induced foci, cells were ‘-irradiated (10 Gy) and 
compared to non-irradiated controls 5 h after ionizing radiation. For this purpose 
the cells were pre-extracted using cold CSK buffer (10 mM HEPES-KOH, pH 7.9, 
100 mM NaCl, 300 mM sucrose, 3 mM MgCl), 1 mM EGTA and 0.5% (v/v) Triton 
X-100) on ice for 5min and then cold CSS buffer (10 mM Tris, pH7.4, 10 mM 
NaCl, 3mM MgCl, 1% (v/v) Tween and 0.5% (w/v) sodium deoxycholate) on ice 
for 5 min. Cells were washed with PBS++ (PBS with 1 mM CaCl, and 0.5mM 
MgCl.) and fixed using 2% PFA/PBS+ + for 20 min at room temperature. Fixed 
cells were washed three times with PBS+ + and stored at 4 °C. The cells were incu- 
bated 20 min in 0.2% Triton X-100/PBS+ + to be permeabilized. Then the cells 
were washed three times in staining buffer (PBS+ +: 1% BSA, 0.15% glycine, 0.1% 
Triton X-100), incubated for 30 min in staining buffer at room temperature, incu- 
bated with the first antibody for 2 h at room temperature in staining buffer, washed 
three times in staining buffer, incubated with the second antibody for 1h at room 
temperature in staining buffer and washed three times in staining buffer. Next, the 
cells were counter-stained with DAPI for 5 min, washed in staining buffer, washed 
in PBS+ +, mounted in Vectashield and sealed with nail polish. Primary antibod- 
ies used in this study were as follows: rabbit anti-RAD51 (70-001, BioAcademia), 
1:20,000 dilution; rabbit anti-53BP1 (A300-272A, Bethyl), 1:4,000 dilution. Secon- 
dary antibodies used in this study were as follows: Alexa Fluor 568 F(ab’), Fragment 
goat anti-rabbit (A21069, Invitrogen), 1:1,000 dilution; Alexa Fluor 488 anti-mouse 
antibody (A11001, Invitrogen), 1:1,000 dilution. 

For REV7 staining in MEF cells, cells cultured on coverslips were treated with 
10 Gy IR and allowed to recover for 4h. Cells were then washed with PBS, pre- 
extracted with 0.5% Triton X-100 solution for 3 min and fixed with 3% parafor- 
maldehyde for 12 min. Coverslips were washed with PBS and then immunostained 
with REV7 antibody (612266, BD Biosciences, 1:200 dilution) and anti-yH2AX or 
anti-53BP1 in 5% goat serum for 1 h at room temperature. Coverslips were washed 
and incubated with secondary antibodies conjugated with rhodemine or FITC for 
30 min at room temperature. Cells were then stained with DAPI to visualize nuclear 
DNA. The coverslips were mounted onto glass slides with anti-fade solution. For 
RIF1 staining in MEF cells, wild-type MEFs stably transduced with indicated shRNA- 
expressing lentiviruses were examined for RIF1 foci following neocarzinostatin 
treatment. Automated quantification of RIF1 foci following mock and neocarzi- 
nostatin treatment was performed using Cell-Profiler software (Broad Institute). 
Laser irradiation of human cells and immunofluorescence staining. Cells were 
grown on plastic disks (17 mm diameter) that were cut using CNC cutter from the 
bottom of standard 10-cm cultivation dish (TPP) ultraviolet-sterilized and placed 
inside the wells of a 12-well plate. BrDU (0.5 [1M) was added into siRNA-transfected 
cells 40h after the transfection to pre-sensitize cells towards UV-A wavelength. 
Twenty-four hours after BrdU addition, the plastic disks with cells were removed 
and covered by a coverslip and immediately placed inside Zeiss AxioObserver Z.1 
inverted microscope combined with LSM 780 confocal module. Cells were irra- 
diated at 20 °C via X 40 water immersion objective (Zeiss C-Apo 40 X/1.2W DICIII), 
using 355nm 65 mW laser set on 100% power. The total laser dose that can be 
further manipulated by the amount of irradiation cycles was empirically set to six 
irradiation cycles. Laser track was pre-defined to cover all the cells within the ac- 
quisition area with at least one stripe across the nucleus. After the irradiation pro- 
cess the coverslip was gently removed and plastic disk was quickly placed back into 
the same well of the 12-well plate and incubated for another 45 min at standard 
cultivation conditions. The plastic disks with laser-irradiated cells were first pro- 
cessed by pre-extraction at 4 °C. It involves washing by PBS (4 °C), equilibration 
for 2 min in sucrose buffer 1 (10 mM PIPES, pH 6.8, 100 mM NaCl, 1.5 mM MgCl, 
and 300 mM sucrose) on ice and then pre-extraction for 15 min on ice, on slow- 
moving shaker using sucrose buffer 2 (10 mM PIPES, pH 6.8, 100 mM NaCl, 1.5 mM 
MgCl, 300 mM sucrose, 0.5% Triton X-100, 5 pg ml’ leupeptin, 2 ig ml! apro- 
tinin, 0.1 mM phenylmethanesulfonylfluoride (PMSF), 1 mM dithiothreitol (DTT)). 
After the pre-extraction cells were washed by PBS (4 °C) and fixed by 4% para- 
formaldehyde (PFA) for 15 min at room temperature. PFA was washed out three 
times by PBS. The disks were further processed as standard coverslips (that is, 
blocking in blocking solution (DMEM plus 20% FCS) for 1 h followed by incuba- 
tion with primary antibodies involving REV7 (BD Bioscience, mouse, 1:200), pS139- 
H2AX (Cell Signaling, 20E3, rabbit, 1:300) and 53BP1 (Santa Cruz, H-300, rabbit 
1:400) for 2h, and with appropriate secondary antibodies coupled to Alexa Fluor 
488 and Alexa Flour 568 fluorophores (dilution 1:1,000) (Life Technologies). Both 
primary and secondary antibodies were dissolved in the blocking solution. After 
washes in PBS, the disks were incubated in 1 1g ml~ | DAPI in dH,O at room 
temperature for 5 min and air dried. Dried disks were placed on a standard micro- 
scopy glass (cell layer face up) and anchored by two rubber bands laced over the 
glass. Stained cells were mounted using VectaShield (Vector Labs) mounting me- 
dium and covered by a coverslip. The samples were examined using Zeiss Axio- 
Observer Z.1 inverted microscope combined with LSM 780 confocal module using 
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X40 oil objective (Zeiss EC PInN 40X/1.3 Oil DICII). It means that after the first 
acquisition the plastic disk and the microscopy glass was marked by diamond cutter 
(to ensure same positioning of the disk in the future), the coverslip was gently 
removed and disk was washed three times in PBS, 0.5% Tween to remove the 
mounting medium. Next, the disk was incubated in the 1X Re-Blot solution (Re- 
Blot Plus Mild, Millipore) for 30 min on a slow moving shaker. The solution was 
washed out three times in PBS. Such sample was ready for new staining procedure 
involving new set of primary and secondary antibodies following the same pro- 
tocol as described above. 

In situ analysis of RAD51 foci formation. Five matched PARPi-resistant and 
-sensitive KB1P(M) tumours were orthotopically transplanted into wild-type FVB/ 
N recipient mice. When tumours reached ~500 mm? in volume, the mice were 
randomized to be either irradiated (dose: 15 Gy) using a CT-guided high precision 
cone beam micro-irradiator (X-RAD 225Cx) or left untreated. As a positive con- 
trol a BRCA1-proficient KP tumour was taken along. Two hours after irradiation 
the tumours were taken out and fixed in 4% formalin. Immunofluorescence staining 
was performed on FFPE slides. RAD51 foci were detected using a non-commercial 
antibody provided by R. Kanaar in a dilution of 1:5,000. 53BP1 foci were detected 
using rabbit anti-53BP1 (A330-272A, Bethyl), diluted 1:500. As a secondary anti- 
body goat-anti rabbit- Alexa Fluor568 (Invitrogen) was used at a dilution of 1:1,000 
(2 pg ml‘). Images were taken by a ‘blinded’ investigator using a confocal micro- 
scope (Leica SP5, Leica Microsystems GmbH), equipped with a X 100 objective. For 
each tumour five random areas (246 X 246 |1m) were imaged. Image stacks (~four 
slices) were analysed in ImageJ, using an in-house developed macro to automat- 
ically and objectively evaluate the RAD51 foci. In brief, nuclei were segmented by 
thresholding the (median-filtered) DAPI signal, followed by a watershed operation 
to separate touching nuclei. For every z-stack the maximum-intensity projection of 
the foci signal was background-subtracted using a difference of gaussians method. 
Next, for each nucleus, foci candidates were identified as locations where the re- 
sulting pixel values exceeded the background by a factor (typically tenfold) times 
the median standard deviation of all nuclei in the image. In combination with addi- 
tional filters discriminating for foci size and absolute brightness this procedure 
yielded a robust and reliable foci count for all nuclei. Results were validated by 
visual inspection. 

REV7 recruitment to local laser-induced DNA damage sites. pEGFP-REV7 or 
pEGFP and 53BP1-mCherry were co-transfected into Breal”’~ p53’ cells using 
X-treme GENE HP DNA Transfection Reagent according to the manufacturer’s 
manual. GFP and mCherry double-positive cells were sorted by flow cytometry and 
seeded onto coverslips. Cells were sensitized by pre-incubation with Hoechst33342 and 
were subsequently irradiated using a 405-nm diode laser (X63 objective, 0.99 mW, 
60% laser power, 50s) on a Leica SP5 confocal microscope equipped for live-cell 
imaging. EGFP-REV7 and 53BP1—mCherry recruitment in living cells was mon- 
itored by time-lapse imaging. 

Alpha track assay. Cells were seeded in dishes with a mylar surface as previously 
described”, allowing o.-particle irradiation through the bottom of the dish. One or 
two hours after irradiation three times for 30 s with a7*!americium source, cells were 
washed once in ice-cold PBS. Subsequently, cells were extracted with cold CSK buf- 
fer (10 mM HEPES-KOH, pH 7.9, 100 mM NaCl, 300 mM sucrose, 3 mM MgCl, 
1 mM EGTA, 0.5% (v/v) Triton X-100) and cold CSS buffer (10 mM Tris, pH 7.4, 
10 mM NaCl, 3 mM MgCl, 1% (v/v) Tween20, 0.5% (w/v) sodium deoxycholate) 
for 5 min each before fixation in 4% PFA in PBS for 20 min at room temperature. 
Fixed cells were washed five times in PBS plus 0.1% Triton X-100 and washed once 
in blocking solution (0.5% BSA plus 0.15% glycine in PBS). Primary antibodies 
were diluted in blocking solution and cells were incubated overnight at 4 °C. After 
incubation, cells were washed five times with PBS plus 0.1% Triton X-100 and 
washed once in blocking solution. Secondary antibodies were diluted in blocking 
solution and cells were incubated at room temperature for at least 1 h. Afterwards, 
cells were washed five times in PBS plus 0.1% Triton X-100 and once in PBS. 
Finally, mylar films were glued on glass slides and cells were mounted using 
Vectashield with DAPI. For quantification, at least 100 53BP1 or MRE11-positive 
tracks were scored for the presence of RPA. Primary antibodies used in this study 
were as follows: rabbit anti-53BP1 (NB100-304, Novus), 1:1,000 dilution; mouse 
anti-RPA2 (Ab2175, Abcam), 1:500 dilution; MRE11 antibody”, 1:200 dilution. 
Secondary antibodies used in this study were as follows: Alexa Fluor 594 goat anti- 
rabbit IgG (A 31631, Invitrogen), Alexa Fluor 488 goat anti-mouse IgG (A11001, 
Invitrogen). 

BrdU propidium iodide cell cycle assay. Cells were seeded in 6-cm dishes and 
attached overnight. The next day, cells were incubated for 15 min with 5 uM BrdU 
in growth medium, trypsinized, washed in PBS and fixed in 70% ethanol overnight. 
Fixed cells were washed in PBS, resuspended in pepsin solution (5 mg pepsin in 
10 ml 0.1 N HCl) and incubated for 20 min at room temperature. Subsequently 
blocking solution (0.5% Tween, 0.1% BSA in PBS) was added to wash. Next, cells 
were resuspended in 2 N HCland incubated 12 min at 37 °C. To neutralize, 100 mM 
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borate buffer (pH 8.5) was added and cells were pelleted. Anti-BrdU-FITC antibody 
(347583, BD Bioscience) was diluted in blocking solution, added to cells and incu- 
bated for 2h on ice. Stained cells were washed once with blocking solution and 
resuspended in 500 il PBS plus 12.5 pl RNaseA and 1 ll propidium iodide (P3566, 
Invitrogen). Cell cycle analysis was performed the next day. 

Survival assay of mES cells. R260°"""?""; Brea 1®©"4 mES cells!® were infected 
with hairpins targeting Rev7 or the vector control and selected with puromycin. 
Expression of mouse Brcal was switched off by overnight incubation with 0.5 1M 
4-hydroxytamoxifen. Four days after switching, 5,000 cells of the indicated groups 
were seeded per well into 6-well plates and assayed for growth. Surviving colonies 
were fixed in formalin and stained with crystal violet. 

DR-GFP for HR assay. R26CERT2t. Brcg 15/4 MES cells were targeted with a 
modified version of the p59X DR-GFP construct as described*. To allow experi- 
ments ona p53-deficient background, cells were infected with a lentiviral p53 shRNA 
(5'-AGAGTATTTCACCCTCAAGAT-3’) using a pLKOI1 vector provided with a 
neomycin resistance marker. A G418-selected p53-deficient clone was subsequently 
infected with hairpins targeting Rev7 or the vector control and selected with puro- 
mycin. Expression of mouse Brca1 was switched off by overnight incubation with 
0.5 1M 4-hydroxytamoxifen to measure the effect of Rev7 loss on HR. HR reporter 
assays were performed by Lipofectamine 2000 (Invitrogen) transfection of the 
I-Scel-mCherry plasmid, which was generated by providing the cBasI-Scel expres- 
sion plasmid with CMV-mCherry (Clontech). Forty-eight hours after transfection, 
mCherry/GFP double-positive cells were monitored by flow cytometry as described’”. 
53BP1 pull-down. 53BP1 pull-downs were performed as described’’. Flag pull- 
downs were performed from 2 mg lysate prepared from MEFs following mock or 
neocarzinostatin treatment (2h at 250 ng ul). Control, 53BP1 and 53BP 1742 
immuneprecipitates were then treated with sequential low-salt (150 mM) and high- 
salt (500 mM) RIPA buffer washes, before re-equilibration in nuclear extract buffer 
(20 mM HEPES, pH 7.9, 100 mM KCl, 0.2 mM EDTA, 20% glycerol, 0.5 mM PMSF, 
0.5 mM DTT and protease inhibitors (Roche, Complete)). After incubation in Hela 
Nuclear Extract (2 mg), control, 53BP1 and 53BP ‘feu complexes were washed four 
times in nuclear extract buffer, then eluted with triple-Flag peptide (Sigma). 
Immunoglobulin CSR. CH12 cells were either mock-treated or stimulated with ago- 
nist anti-CD40 antibody (0.5 jig ml ~ l. eBioscience; HM40-3), mouse IL-4 (5 ng pl” L 
R&D Systems) and TGF-B1 (1.25 ng pil” '; R&D Systems). Cell-surface IgA expres- 
sion was determined by flow cytometry, immunostaining with biotinylated anti- 
mouse IgA antibody (eBioscience; 13-5994), and Alexa488-streptavidin conjugate 
(Life Technologies). 

Carboxyfluorescein succinimidyl ester assay. Cell proliferation was assessed in 
stimulated CH12 cells using carboxyfluorescein succinimidy] ester (CFSE) accord- 
ing to manufacturer’s instructions (CellTrace; Life Technologies). 

Chromatin immunoprecipitation. Each ChIP was performed from chromatin 
prepared from ~1 X 10’ CH12 cells stimulated for 30h with agonist CD40 anti- 
body, IL-4 and TGF-B1 essentially as described”. For each individual ChIP, 4 1g of 
RPA34- 20 (Ab-3, Calbiochem), 2 ug H2AX (3522-1, Epitomics), or 4 ug control 
mouse anti-IgG (sc-2025; Santa Cruz) coupled to 25 yl Protein-G Dynabeads (Life 
Technologies, 10003D) was used. Quantities of immunoprecipitated chromatin 
were calculated relative to total input chromatin by quantitative PCR in duplicate 
on an CFX96 Real-Time Analyzer (Biorad) with the use of iQ SYBR Green (Bio- 
rad) for each primer pair (see below). Rpp30: forward, 5'-TCCAGTGTGCAAG 
AAAGCTAAATG-3’, reverse, 5’-GGCAGTGCGTGGAGACTCA-3’; A (target 
IgH Sy): forward, 5'-CAATGTGGTTTAATGAATTTGAAGTTGCCA-3’, reverse, 
5'-TCTCACACTCACCTTGGATCTAAGCACTGT-3’; B (target IgH Sy): for- 
ward, 5'-GCTAAACTGAGGTGATTACTCTGAGGTAAG-3’, reverse, 5'-GTT 
TAGCTTAGCGGCCCAGCTCATTCCAGT-3’; C (target IgH Sy1): forward, 5’- 
AGTGTGGGAACCCAGTCAAA-3’, reverse, 5’-GTACTCTCACCGGGATCA 
GC-3’; D (target IgH Sa): forward, 5'-TGAAAAGACTTTGGATGAAATGTGA 
ACCAA-3’, reverse, 5’- GATACTAGGTTGCATGGCTCCATTCACACA-3’. 
Immunohistochemistry on paraffin sections. The panel of formalin-fixed, paraffin- 
embedded archival specimens from a series of 50 human primary breast carcinomas 


was examined, surgically resected before any chemotherapy or radiotherapy. In 
parallel, sections from ten specimens of formalin-fixed, paraffin-embedded his- 
tologically normal human breast tissues were used as normal tissue controls, for 
immunohistochemical analysis of REV7 expression patterns. The breast carcino- 
mas were all of the triple-negative type, defined in this cohort as having fewer than 
1% tumour cells positive in standard immunohistochemical staining for oestrogen 
and progesterone receptor proteins, and lacking amplification of the HER2 (also 
known as ERBB2) gene. For the immunohistochemical analysis, 4-p»m-thick sec- 
tions were cut from representative blocks of the tumour tissues, the sections were 
deparaffinized and processed for sensitive immunoperoxidase staining with the 
primary mouse monoclonal antibody to REV7 (BD Biosciences, 612266, diluted 
1:100). The staining procedure was essentially as described’, with antigen unmask- 
ing in citrate buffer, pH 6.0, for 15-20 min in a microwave oven, and overnight 
incubation with the primary antibody, followed by the Vectastain Elite kit (Vector 
Laboratories) and nickel sulfate enhancement without nuclear counterstaining*’. 
Mouse normal serum and antibody to yH2AX served as negative and positive con- 
trols, respectively. Evaluation of the staining patterns was performed by two inde- 
pendent observers (with very similar outcomes), including a senior oncopathologist 
with more than 20 years of experience with breast cancer pathology. The results 
were scored in the following categories, based on comparison of cancer cells with 
the series of normal breast tissue controls, and also the normal of cells present 
within each of the tumour sections. As normal breast epithelium showed repro- 
ducible positivity for REV7 protein in over 90% of cells, we considered as aber- 
rantly decreased expression in a tumour when fewer than 70% of cancer cells were 
positive. In addition to the percentage of stained tumour cells, staining intensity 
was classified as either normal (comparable with the intensity of normal cells pre- 
sent on each section) or aberrantly low (clearly below the intensity seen in adjacent 
normal cells, and up to undetectable in some cases). Overall, while 6 out of 47 
informative cases showed concomitantly aberrant fraction of REV7-stained cells 
and reduced intensity of staining, 12 additional cases showed less pronounced 
defects limited to either staining intensity or reduced percentage of cancer cells, 
respectively. As the primary goal of these analyses was to establish a detection assay 
for REV7 on archival tissue specimens and to assess the frequency of potentially 
REV7-deficient breast tumours, correlation analyses with clinical parameters includ- 
ing treatment outcome remain to be performed on larger cohorts of patients in the 
future. 

Statistics. Statistical tests used were log-rank test, t-test and Mann-Whitney Utest, 
with P< 0.05 as the significance level. No statistical methods were used to pre- 
determine sample size. 
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Extended Data Figure 1 | Loss of Rev7 causes PARPi resistance in vitro and 
in vivo. a, Quantification of Rev7 transcript levels in KB1P-B11 cells 
transduced with Rev7-targeting shRNAs or the vector control. Hprt was used as 
a control for transcript expression. The data represent mean ~ s.d. b, c, Cell 
proliferation rates in KB1P-G3 (b) or KB1P-B11 (c) cells analysed using 

the MTT assay. d-g, Long-term clonogenic survival assays and quantification 
of KB1P-G3 (d, f) or KB1P-B11 (e, g) cells transduced with the indicated 
constructs and treatments. All the groups were normalized by the absorbance of 
the vector control. The data represent mean = s.d. h, Quantification of the real 
colony numbers from the short-term clonogenic survival assay of KB1P-G3 
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cells with or without Rev7 loss exposed to olaparib. i, REV7 protein levels were 
determined by western blotting of lysates derived from KB1P-G3 cells 
transduced with the indicated constructs. j, Overall survival of mice with 
KB1P-B11-derived Rev7-depleted or control tumours treated with one regimen 
of 50 mgkg ’ olaparib daily for 28 days or left untreated. The P value 

was calculated using the log-rank test. k, l, Relative tumour growth of individual 
KB1P-G3- (k) and KB1P-B11- (1) derived Rev7-depleted or control 

tumours treated with one regimen of 50 mgkg * olaparib daily for 28 days 
or left untreated. 
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Extended Data Figure 2 | Loss of REV7 causes olaparib resistance in c, Quantification of the clonogenic assays using absorbance of crystal violet at 
BRCA\1-deficient SUM149PT cells. a, Western blotting analysis of REV7 or 590 nm. The data represent mean + s.d. All the groups were normalized by the 
53BP1 expression in SUM149PT cells transduced with REV7- or 53BP1- absorbance of the vector control and showed significant differences to the 


targeting hairpins or the vector control. b, Example of a long-term clonogenic _ control (P< 0.01, t-test). 
survival assay using the indicated hairpins and olaparib concentrations. 
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Extended Data Figure 3 | Rev1 or Rev3 inhibition and PARPi sensitivity of 
Brcal~/— ps3’ ~ mammary tumour cells. a, b, Quantification of Rev1 (a) or 
Rev3 (b) transcript levels in KB1P-G3 cells transduced with Rev1- or Rev3- 
targeting shRNAs or the vector control. Hprt was used as a control for 
transcript expression. The data represent the mean + s.d. c, Long-term 
clonogenic survival assays of KB1P-G3 cells exposed to the indicated PARP 
inhibitors. d, Quantification of the clonogenic assay by determining the 
absorbance of crystal violet at 590 nm. All the groups were normalized by the 


-GFP 


absorbance of the vector control. The data represent the mean = s.d. 

e, f, Quantification of Rev7 transcript (e) or protein (f) levels in KB1P-G3 cells 
transduced with Rev7-targeting shRNAs or the vector control. Hprt was used as 
a control for transcript expression; o-tubulin as a control for protein 
expression. The data represent mean + s.d. g, GFP-tagged REV7 mutants 
recruitment to sites of DNA damage was observed 5 min after 405 nm laser 
exposure (0.99 mW, 60% laser power, 50s) in KB1P-B11 cells. Scale bar, 10 jum. 
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Extended Data Figure 4 | REV7 recruitment to the DNA damage sites in 
human cells. a—c, Human REV7 recruitment to sites of laser-induced DNA 
damage was analysed in U2OS cells transfected with siRNAs targeting RNF8 
(siRNE8), RNF168 (siRNEF168) (a), 53BP1 (siS3BP1) (c) and GEP (siGEP). 
RNF8 and RNF168 protein levels were determined by western blotting 

(b) using lysates derived from U2OS cells transfecting with the indicated 
siRNAs. d, For the quantification of the REV7 signal within laser-induced DNA 
damage stripes, a minimum of 100 striped (that is, YH2AX-positive) cells 
were analysed for the presence of the REV7 signal in two independent 
experiments. Scale bars, 50 um. e, RAD51 focus (red) formation in KB1P-B11 


cells before and 5h after 10 Gy ionizing radiation. Scale bar, 10 jm. 

f, Quantification of RAD51 foci in KB1P-B11 cells in the presence or absence of 
REV7 depletion. At least 150 cells were analysed per group in three 
independent experiments each. Error bars indicate s.d.; IR denotes 5h after 
10 Gy ionizing radiation. g, Western blotting analysis of REV7-depleted 
KB1P-G3 cells transfected with human REV7-GFP or Rev7-shRNA-resistant 
mouse REV7-GFP fusion proteins. h, Same as in e and f using the ATM 
inhibitor KU55933 with or without IR (5h after 10 Gy ionizing radiation). 

i, Long-term clonogenic survival assay of KB1P-G3 cells exposed to olaparib in 
the presence or absence of KU55933 pre-treatment. 
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Extended Data Figure 5 | Loss of Rev7 does not cause PARPi resistance indicated treatments. All the groups were normalized by the absorbance of the 
in Brca2~/~ p53~’~ or p53~’~ mammary tumour cells in vitro. vector control. The data represent mean + s.d. g, Quantification of Rev7 


a, b, Quantification of Rev7 transcript levels in Brca2 /~ p53’ ~ (KB2P-1.2lor _ transcript levels in p53’ ~ (KP3.33) cells transduced with the indicated 
KB2P-3.4) cells transduced with Rev7-targeting shRNAs or the vector control. _ constructs. Hprt was used as a control for transcript expression and the data 
Hprt was used as a control for transcript expression. The data represent the represent the mean = s.d. h, i, Long-term clonogenic survival assays and 
mean + s.d. c-f, Long-term clonogenic survival assays and quantification of quantification of KP3.33 cells exposed to the indicated treatments. The data 
KB2P-1.21 or KB2P-3.4 cells with or without Rev7 depletion exposed to the represent the mean = s.d. 
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BRCA1-deficient cells after ionizing radiation. a, Quantification of RPA- 
positive « tracks in KB1P-B11 cells 1 or 2h after irradiation with « particles. 


b, Quantification of RPA- and 53BP1-positive « tracks in KB1P-B11 cells 
transfected with non-targeting control siRNAs or siRNAs against Ctip. 


c, Cell cycle analysis (BrdU incorporation and propidium iodine labelling) 


of KB1P-B11 cells transduced with the indicated constructs and siRNAs. 
d, e, Quantification of Rev7 transcript (d) or protein (e) levels in 


60 Breat’;p53” 


RPA positive alpha-tracks (%) 


vector | vector | Rev7 | Rev7 Rev7 | Rev7 
-shi# | -sh1# -sh2# | -sh2# 


2. 


= 
= Ww 


c 
2 
® 0.8 
a 2° 
cS . 0.4 
& 02 
Da 
0) 
vector Rev7 Rev7 
-sh2# -sh3# 
c 
2 
% 
£ 
° 
[hm kK 
Q 21 
( 
2 
8 5 i **k 
ae] 
o 
a 
& fe) = . 
6 Brea: -/= -/- -|- 
= vector Rev7 Rev7 
_~ -sh2# -sh3# 
DS 
3 7 
h 3 100 - 
& 75 - - 
Qa 
= 50 4 
=4 
A 25 mi 
s 
S (@) : : : 
7) 
3 Brea: -|- -/- -/- 
7 vector Rev7 Rev7 
-sh2# -sh3# 


BRCA1-deficient mES cells transduced with Rev7-targeting shRNAs or 

the vector control. Hprt was used as a control for transcript expression, 
a.-tubulin as a control for protein expression. The data represent mean = s.d. 
f, Representative images of surviving colonies of Brcal~’" mES cells 
transduced with an empty vector control or Rev7-targeting shRNAs. 

g, Quantification of colony formation normalized to the vector control. 

h, Quantification of RAD51 foci in Brcal~/~ mES cells that were 
transduced with the indicated constructs. 
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Extended Data Figure 7 | REV7 loss frequently occurs in triple-negative detection of REV7 in human breast carcinomas shows moderate to high 
breast cancer. a, b, Quantification of human REV7 transcript levels (a) and nuclear expression in normal human breast tissue (data not shown), and 
protein levels (b) in U2OS cells transduced with indicated constructs. Two most invasive breast tumours (c). Aberrant reduction of REV7 with less than 
different pairs of primers for REV7 or HPRT were used for the quantification of | 70% of cancer cells that show nuclear positivity (d, e) was observed in 18 out 


REV7 transcript levels. c-e, Examples of aberrantly reduced REV7 protein of 50 cases. 


expression in triple-negative human breast carcinomas. Immunohistochemical 
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Extended Data Figure 8 | REV7 is a downstream effector of 53BP1. 

a, Quantification of 53BP1 foci in KB1P-G3 cells in the presence or absence 
of REV7 depletion. At least 100 cells were analysed per group in three 
independent experiments each. Error bars indicate s.d.; IR denotes 5h after 
10 Gy ionizing radiation. b, REV7 or 53BP1 protein levels were determined 
by western blotting of lysates derived from MEF cells transduced with the 
indicated control (CNTL) or Rev7- and 53bp1-targeting shRNA constructs. 
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c, d, RIF1 foci formation (c) after neocarzinostatin (NCS) treatment and 
quantification of RIF1 foci (d) in MEF cells in the presence or absence of REV7 
or 53BP1 depletion. e, Flag pull-downs were performed from 2 mg lysate 
prepared from 53bp1 ’~, 53bp1 /~ plus 53BP1 and 53bp1 ‘~ plus 53BP 17°42 
MEFs after mock or neocarzinostatin treatment. Control, 53BP1 and 
53BP1°°*2 immunoprecipitates were incubated in HeLa nuclear extract 
(HNE, 2 mg) and then eluted with triple-Flag peptide. HA, haemagglutinin. 


©2015 Macmillan Publishers Limited. All rights reserved 


fo) = = 
[o) o iN) 


me 
iN 


Rev7 transcript levels (arb. units) 
f=) 
[on 


Cc 


LETTER 


Tubulin | = ~« = -- 


CH12 
0.2 - shRNA: CNTL REV7  REV7—-REV7_—s53BP'1 
: (1) (2) (3) (1) 
mi i ij ON a a 
shRNA: <Y WS WD .@ Hprt 
OP ON F 
vy &f es Aicda 
uGLT 
aGLT 
d CH12 cells: 40 h posti-stimulation (CD40 AB, IL-4, TGF-81) 
shCNTL sh53BP1(1) shREV7(1) shREV7(2) 
e 36h 12h 36h 12h 36h 12h 36h 12h 36h 12h 


20 WB shcntTL 
= shREV7(3) 
100 shREV3(1) 

shREV3(2) 
S 80 
mo} 
& 60 
ri 
E 
° 40 
£ 
H 20 
U 
2 

0 

Cit: - + + + + 


Extended Data Figure 9 | The effect of REV7 inhibition on CSR after 
antigenic stimulation of CH12 cells. a, Rev7 messenger RNA levels 
determined by qRT-PCR were normalized against B-actin (Actb) transcripts in 
the indicated shRNA-transduced CH12 cell lines. The data represent the 
mean + s.e.m. from two primer sets specific for Rev7 transcript. CNTL, control. 
b, 53BP1 protein of each group normalized to vector-transduced cells (CH12) 
was analysed by western blotting. c, IgH 1 and « germ-line transcripts (GLT) 
and Aid mRNA were estimated by semi-quantitative RT-PCR using twofold 
serial dilutions of cDNA made from indicated CH12 cell lines 40h after 


500 


stimulation. Hprt was used as a control for transcript expression. 

d, Representative flow cytometric profiles of shRNA-transduced CH12 B cells 
stained with anti-IgA antibody 40h after stimulation with the indicated 
cytokines. e, Cells (CH12) were labelled with CFSE immediately before 
cytokine stimulation as in Fig. 4d, and cell proliferation was assessed by 

flow cytometry at indicated time points. f, Quantification of CSR to IgA of 
shRNA-transduced CH12 cells 40 h after stimulation with CD40 antibody, IL-4 
and TGF-f1 (CIT). Data represent the mean + s.d. from two independent 
experiments performed in triplicate. 


©2015 Macmillan Publishers Limited. All rights reserved 


Make A lea 


doi:10.1038/nature14247 


Atomic structure of anthrax protective antigen pore 
elucidates toxin translocation 
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Anthrax toxin, comprising protective antigen, lethal factor, and 
oedema factor, is the major virulence factor of Bacillus anthracis, 
an agent that causes high mortality in humans and animals. Pro- 
tective antigen forms oligomeric prepores that undergo conversion 
to membrane-spanning pores by endosomal acidification, and these 
pores translocate the enzymes lethal factor and oedema factor into 
the cytosol of target cells'. Protective antigen is not only a vaccine 
component and therapeutic target for anthrax infections but also an 
excellent model system for understanding the mechanism of protein 
translocation. On the basis of biochemical and electrophysiological 
results, researchers have proposed that a phi (®)-clamp composed 
of phenylalanine (Phe)427 residues of protective antigen catalyses 
protein translocation via a charge-state-dependent Brownian ratchet’. 
Although atomic structures of protective antigen prepores are 
available’’"*, how protective antigen senses low pH, converts to 
active pore, and translocates lethal factor and oedema factor are 
not well defined without an atomic model of its pore. Here, by cryo- 
electron microscopy with direct electron counting, we determine the 
protective antigen pore structure at 2.9-A resolution. The structure 
reveals the long-sought-after catalytic D-clamp and the membrane- 
spanning translocation channel, and supports the Brownian ratchet 
model for protein translocation. Comparisons of four structures 
reveal conformational changes in prepore to pore conversion that 
support a multi-step mechanism by which low pH is sensed and the 
membrane-spanning channel is formed. 

Triggering conversion from protective antigen (PA) prepore to pore 
by in vitro acidification leads to rapid and irreversible aggregation. 
Attempts to prevent aggregation by screening detergents have largely 
failed’. By low-pH treatment of PA prepores directly on electron micros- 
copy (EM) grids containing a thin layer of continuous carbon film, we 
obtained dispersed particles of PA pore without aggregation (Extended 
Data Fig. 1). We then acquired drift-corrected cryo-electron microscopy 
(cryoEM) images (Extended Data Fig. 1b-d) and reconstructed a map 
at an overall resolution of 2.9 A using 60,455 particles (Fig. 1, Extended 
Data Fig. 2 and Supplementary Video 1). The resolution for most regions 
of the cryoEM map is ~2.8 A (Extended Data Fig. 2c). Our map reveals 
rich high-resolution structural features, including amino-acid side chains 
and 14 chelated Ca** ions (Extended Data Fig. le-h), and has allowed 
unambiguous de novo atomic modelling (Extended Data Table 1) and 
detailed structure and function analyses. 

The overall structure of the PA pore has a ‘flower-on-a-stem’ archi- 
tecture, including corolla, calyx, and stem from top to bottom (Figs 1b 
and 2a and Supplementary Video 1). Each PA protomer is divided into 
four domains in the PA prepore’®, named 1’, 2,3, and 4. In the PA pore, 
domains 1’, 3, and 4 form the corolla and domain 2 forms the calyx and 
the stem; therefore we designate the parts of domain 2 corresponding 
to the calyx and the stem as 2c (residues 259-274 and 354-487) and 2s 
(residues 275-353), respectively (Fig. 2b, c). Domains 1’ and 2c forma 
compact structure responsible for substrate protein binding and intake 
(Fig. 2). Domain 2s is an extended B-hairpin (2B, and 2£3,), seven 


copies of which assemble to form a membrane-spanning 14-stranded 
B-barrel 105 Ain length and 27 A (from C,, to C,,) in diameter (Fig. 2). 
Domain 3 is located peripherally and has close contact with domains 1’ 
and 2c (Fig. 2b). The cryoEM density of domain 4 is weak and has the 
lowest resolution among all domains (inset of Fig. 1b and Extended Data 
Fig. 2c), probably because of its flexibility from minimal contact with the 
other domains. Rigid-body fitting of domain 4 of the PA prepore crystal 
structure to the cryoEM map shows domain 4 shifts ~4 A towards the 
central axis in the PA pore (Extended Data Fig. 3). 

The translocation channel of the PA pore has a funnel shape and can 
be divided into four parts based on diameters calculated with MOLE": 
mouth, ®-clamp, throat, and tube (Fig. 3a, b). Its surface is negatively 
charged and mainly hydrophilic, but hydrophobic patches are seen at 
the a-clamp” of the mouth, near the ®-clamp, and at the middle of the 
tube (Fig. 3a, c). As proposed on the basis of the PA prepore structure’, 
the negatively charged surface promotes passage of cations, and the 
hydrophilic surface facilitates passage of substrate proteins/polypeptides. 

The mouth has a 30-A opening and inner diameters varying down 
to 20 A (Fig. 3a); it can accommodate protein secondary structure ele- 
ments, but not folded domains such as the amino (N)-terminal of lethal 
factor (LFy). The ®-clamp below the mouth becomes the bottleneck of 
the entire channel, with a solvent-excluded inner diameter of only 6 A 
(Fig. 3a, b), which is smaller than protein secondary structure elements 
and therefore may only allow passage of fully unfolded polypeptides. 
Underneath the ®-clamp are the throat, which is an enlarged (~18 A) 
bulb-shaped chamber, and the tube formed by the 14-stranded B-barrel 
with inner diameters in the range 12-18 A and rich in Ser and Thr 
residues in its middle region (Fig. 3a, b). The large diameter (>12 A) of 


Figure 1 | CryoEM reconstruction of the PA pore. a, b, Surface 
representations of the cryoEM map of the PA pore at 2.9-A resolution as viewed 
from the top (a) and the side (b). Individual protomers of PA heptamer are 
colour-coded. Inset of b shows the unsharpened cryoEM map in which the 
flexible domains 4 (arrowheads) are visible. 
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Figure 2 | Atomic model of the PA pore. a, Top and side views of the atomic 
model of the PA pore shown as ribbons. Protomers are colour-coded except 
for domain 4 (grey). b, Structural comparison of the protomers of the PA pore 
and prepore (PDB accession number 1TZO). The domains are coloured 
differently according to c. c, Domain organization of the PA protomer. 


the throat and tube can accommodate an o-helix which may be formed 
by polypeptides after passing the D-clamp (Fig. 3a). The diameter and 
the hydrophilic property of this part of channel are similar to those of the 
exit tunnel of the ribosome, which is proposed to translocate o-helices’”"*. 

The mouth and the tube are the only two openings of the channel, 
and are accessible to the endosomal and cytosolic compartments, respec- 
tively (Fig. 3a). The rest of the channel is ‘water-tight’ and without holes 
permeable to small molecules. The substrate protein blocks the small 
hole of the ®-clamp before or during translocation’. Thus the ®-clamp 
may act as a gate separating the endosomal and cytosolic compartments. 
The differences of proton concentration (ApH) and electrical poten- 
tial (Ay) across the ®-clamp (Fig. 3a) may serve as the primary driving 
force for substrate protein translocation’. 

In contrast to the hydrophilic inner surface of the B-barrel, its outer 
surface is largely hydrophobic (Fig. 3c, d). The Phe residues (313, 314, 
and 324) form two aromatic belts on opposite sides of the lipid bilayer 
(Fig. 3d), which may stabilize membrane insertion’””’. The cryoEM map 
shows that the transmembrane region is surrounded by a cloud of dis- 
ordered densities, which we interpret as bound detergent molecules 
that were added during sample preparation (Fig. 3e). Surprisingly, an 
additional hydrophobic surface in the middle of the B-barrel contain- 
ing a cluster of hydrophobic residues (Ile289, Val332, Ile334, Leu338 
and Leu340 from each protomer) was also bound with detergent mole- 
cules (Fig. 3d—f). However, the function of this additional hydrophobic 
region is unknown. 

The overall architecture of the PA pore is similar to those of bacterial 
toxins x-haemolysin and Vibrio cholerae cytolysin in their membrane- 
inserted conformations (Extended Data Fig. 4) despite differences in 
functions’’”. The B-barrel of the PA pore is twice as long as those of 
the other two toxins and may facilitate LF and oedema factor translo- 
cation by entropically stabilizing an «-helix within the confinement of 
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Figure 3 | Translocation channel of the PA pore. a, Electrostatic surface (left) 
and channel radius profile (right) of the PA pore. An o-helix (green surface 
model of residues 555-574 of lethal factor; PDB accession number 1J7N) is 
modelled in the B-barrel. The green arrow depicts the direction of protein 
translocation. Colour scale, kcal (mol e)~’. b, The translocation channel (dots) 
running through the ®-clamp. Two protomers of the PA pore are shown as 
ribbons. Residues Glu398, Asp425, Asp426, Phe427, and Ser429 (ball-and- 
stick) are exposed to the channel. c, d, Hydropathy surface of the PA pore 
(brown: hydrophobic; purple: hydrophilic; white: neutral). In ¢, the front half 
of the structure is removed to show the luminal surface. In d, two 1,2- 
dihexadecanoyl-sn-glycero-3-phosphocholine (DPPC) lipid molecules are 
modelled near the membrane insertion region. e, Bottom and side views of the 
segmented cryoEM map showing the f-barrel (yellow) bound with disordered 
detergent molecules (grey). f, The 14-stranded B-barrel (ribbons) and the 
hydrophobic residues (spheres) on its outer surface. The hydrophobic residues 
are depicted on different protomers for the ease of presentation. 


the cylindrical channel’. This longer barrel may also be necessary 
to accommodate its receptor situated between domain 4 and the host 
membrane” (Extended Data Fig. 4). Except for this length difference, 
the 14-stranded B-barrels of the PA pore, #-haemolysin, and V. cholerae 
cytolysin share geometries, such as diameter, twist of §-strands, and 
pitch length (Extended Data Fig. 4). 

In the PA pore, the seven 2819-2; loops of the heptamer converge 
to form an iris with a 6-A hole bounded by a symmetrical arrangement 
of the seven Phe427 residues (Fig. 4). By contrast, these loops in the PA 
prepore do not engage each other'®”, but circumscribe a 30-A central 
hole with adjacent Phe427 residues spaced ~ 14 A apart (Extended Data 
Fig. 5a). In the PA pore, each 289-2, loop is stabilized by close inter- 
actions with the 2B,-2Bs loop of the same protomer and the 2B,9-28,, 
loops of its two neighbouring protomers. The hydrogen bonds between 
Asn399 and Ser428 and between Asn399 and Lys397’ of an adjacent 
protomer form a chain of interactions, which give rise to a ring of loops 
at the iris inside the PA pore (Extended Data Fig. 5b). Consistently, 
mutation of Ser428 abolished protein translocation, and mutation of 
Lys397 or Asn399 resulted in a dominant negative effect****. In the PA 
homologues found in Clostridium species, Lys397 and Asp426 are both 
replaced by uncharged Gln’®, implying that this chain of interactions 
might be formed differently in these homologues. 

The aryl plane of Phe427 from each protomer is parallel to the cen- 
tral axis of the PA pore. Neighbouring Phe427 residues interact with each 
other by aromatic CH-z interaction in a tilted T-shaped configuration” 
and possibly by hydrophobic interaction, thus forming the O-clamp 
(Fig. 4 and Extended Data Fig. 5). The integrity of the ®-clamp is required 
for catalysis of protein translocation, as mutation of even a single Phe427 
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Figure 4 | Structure of the ®-clamp. a, Top view of the ®-clamp region of the 
PA pore showing the cryoEM map (mesh) superimposed with the atomic 
model (stick). b, Tilted view of the ®-clamp with seven protomers coloured 
differently, showing the aromatic CH-n interaction. c, Cross-section side view 


residue of the ®-clamp severely reduced translocation efficiency and 
disrupted the seal against cation passage. Because the PA pore trans- 
locates polypeptides with various side chains, the ®-clamp may act 
like an elastic ‘O-ring’, changing its size and shape to allow an unhin- 
dered passage of different amino-acid residues while maintaining a 
good seal during translocation. In the recently reported structure of 
bacterial amyloid secretion channel CsgG, eight Phe residues form a 
®-clamp, which differs from that in the PA pore in having a 9.5-A hole 
and facilitating protein translocation in an ungated manner”. 
Evidence from electrophysiological studies indicates that the proton 
gradient across the endosomal membrane is the primary driving force 
for unidirectional translocation of proteins through the PA pore’. Proton- 
driven transporters usually involve protonation-dependent confor- 
mational changes of two or more alternating gates. In the PA pore, 
however, there is only one gate, namely the ®-clamp, and neither the 
@-clamp nor its nearby residues probably undergo a protonation- 
dependent conformational change. Indeed, the PA pore structure sup- 
ports the charge-state-dependent Brownian ratchet model” ”’, proposed 
earlier. In this model, a negative electrostatic barrier within the pore 
hinders the passage, by Brownian motion, of deprotonated acidic 
residues**”°. The fact that acidic residues in the acidic environment 
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Figure 5 | Conversion from prepore to pore. a, Superimposition of domains 
2c of the PA pore and prepore (monomer; PDB accession number 3TEW). 
Loops undergoing dramatic conformational changes during the prepore to 
pore conversion are labelled (blue). b, Overview of the three layers of 
conformational changes during the conversion. The interaction between the 
2Bo-2B10 loop and the B-sheet (blue box) is shown in e. c, Superimposition of 
the PA pore (pink, orange, and blue; coloured by domain) and prepore (light 
green; PDB accession number 1TZO). Domains 4 and 2s are removed for 


Layer 2 
(Domain 2c rotated) 


of the translocation channel near the ®-clamp region. The ®-clamp (Phe427) 
and the conserved acidic residue Asp425 are coloured in orange and red, 
respectively. 


of the endosome have a higher probability of being protonated, and are 
thus free to pass the barrier, than those in the neutral environment of 
the cytosol, leads necessarily to unidirectional movement of polypep- 
tides across the barrier. Consistent with this model, the PA pore struc- 
ture shows three acidic residues, Asp425, Asp426, and Glu398, to be 
near the ®-clamp and proximal to the pore axis (Fig. 3b), generating a 
strong negative electrostatic barrier demarcating the endosomal and 
cytosolic compartments. In addition, the highly conserved acidic res- 
idue Asp425 located directly underneath the O-clamp is ideally posi- 
tioned as a proton sink that may strip off protons from protonated 
acidic residues passing the ®-clamp (Fig. 4c). Besides the charge barrier 
in the ®-clamp, efficient protein translocation may require additional 
charged spots, such as the top region of the B-barrel’. 

A key question about the low-pH-triggered conversion from prepore 
to pore is how PA senses pH. A notable conformational change during 
the conversion is that the 2B;.—2B, loop is flipped from one side to the 
other (Fig. 5a and Extended Data Fig. 6). Interestingly, this loop has 
different conformations in the two crystal structures of PA octameric 
prepores ([PA¢3]s, PDB accession number 3HVD; [PAg3]3[LFn]4, PDB 
accession number 3KWV) whereas the remaining parts of the struc- 
tures are largely unchanged”’*"’; that is, this loop is in the pore and 


Pore 
(8-barrel assembled) 


Layer 3 
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clarity. Between the two structures, domain 2c has a Cx root-mean-square 
deviation = 6.87 A, whereas domains 1’ and 3 remain unchanged with a Co 
root-mean-square deviation = 0.54 Aand0.58 A, respectively. d, Illustration of 
the steps of conformational changes during the prepore to pore conversion. 
For simplicity, only two protomers are illustrated. e, Interface between the 
2B9-2B19 loop (green) and the bent B-sheet (gold) of 2B)., 2B, and 2B, of the 
adjacent protomer in the PA pore. Hydrogen bonds between the backbone of 
the loop and the side chains of the B-sheet are depicted with dashed lines. 
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prepore states in 3HVD (Extended Data Fig. 6b) and 3KWV, respec- 
tively. This comparison suggests that the 2B)9-2B,, loop can switch 
between two conformations without affecting the overall structure. 
Mutation of the conserved Asn422 or Asp425 in this loop abolished the 
prepore to pore conversion at low pH in a dominant negative manner”. 
Therefore we interpret this loop as a pH sensor. Surprisingly, this loop 
does not possess any His, a residue whose pK, of approximately 6 falls 
in the pH range of 5-7 where the prepore to pore conversion occurs. 
The conformational change of this loop might result from destabiliza- 
tion and rearrangement of hydrogen bonds upon exposure to low pH, 
or alternatively from perturbed pK, of Asp residues (425 and 426) in 
this loop. We note that other parts of PA might impact pH sensing. 
Prior work has shown mutation of residues near or in the 2B.-2B3 loop, 
or at the domain 2/domain 4 interface, or deletion of the 2B.-2B3 loop 
changed the pH threshold for conversion but did not abolish the low- 
PH sensitivity>*°. These observations suggest that the involved resi- 
dues might regulate pH sensing. 

We find that the prepore to pore conversion involves three layers of 
conformational changes, all located within domain 2 (Fig. 5b). The first 
layer is the above-mentioned pH-sensing 2Bj9-28,; loop. Low-pH- 
triggered conformational change of this loop is relayed to the follow- 
ing layers sequentially as detailed below. 

The second layer involves the 2B7-28g, 2B;-2B,, and 2B, .-2B,3 loops, 
which are positioned in an anti-parallel manner (Fig. 5a). In the PA 
pore, the N-terminal half of the 2B7-2Bs loop moves downwards to 
form a U-shaped turn tethered to the putative pH-sensing 2B 19-2611 
loop via a hydrogen bond between Asn399 and Ser428 (Fig. 5a and 
Extended Data Figs 5b and 6e). This new inter-loop interaction may 
relay the low-pH-triggered conformational changes from the first layer 
to the second layer as supported by the dominant negative effect of 
Asn399 or Lys397 mutation**”*, which breaks the hydrogen bond net- 
work connecting these two loops (Extended Data Fig. 5b). Asa result of 
the downward pull from the 287-2 loop, all three loops of the second 
layer become straightened in the PA pore, with parts of the 2B;-2B, and 
2812-2813 loops forming strands to augment 2{, and 2B, respectively 
(Fig. 5a and Extended Data Fig. 6c, f). This conformational change is 
supported by the observation that mutating any of Pro379, Val455, and 
Asn458 of these loops led to defects in protein translocation’*. This 
conformational change leads to the convergence of domain 2c of each 
protomer and formation of the ®-clamp by a 15° rotation of domain 
2c around the hinge residues Ile261 and Tyr456, with its distal edge 
moving for 12 A towards the central axis of the PA pore (Fig. 5c and 
Supplementary Video 2). Indeed, tethering domain 2 to domain 4 by 
receptor binding or stabilizing the interface between these two domains 
by a poly-y-D-glutamate capsule impeded domain convergence and 
therefore decreased the pH threshold required for the prepore to pore 
conversion’*’?”°. Conversely, destabilizing the domain 2/domain 4 inter- 
face by point mutations increased the pH threshold*’. 

The third layer, resulting from the second layer of conformational 
changes, involves release and refolding of the precursor of domain 2s 
(that is, 2B., 2B3, 20, and the membrane insertion loop), and leads to 
the formation of the membrane-spanning {-barrel (Figs 2 and 5b and 
Extended Data Fig. 7). The rotation of domain 2c opens the pocket 
between domain 2c and domain 4 to release the precursors of domain 
2s, which then come together to form the barrel (see details in Extended 
Data Fig. 7b). The B-barrel formation might be the result, rather than 
the cause, of the convergence of domain 2c, as supported by a cryoEM 
map obtained from a subset of particles that shows converged domain 
2c in the pore state but lacks the B-barrel (Extended Data Fig. 8). We 
note that the stability of the domain 2/domain 4 interface was found to 
bea rate-limiting barrier to the conversion” and the separation between 
domain 2c and domain 4 might also happen at an earlier stage of the 
conversion. 

Taken together, these results suggest a multi-step mechanistic model 
of the low-pH-triggered conversion from prepore to pore (Fig. 5d). 
First, the 2B 9-2; loop as a pH sensor changes its conformation upon 
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acidification (layer 1). Second, consequently, the 2B7-2g, 2B5-2B,, and 
2B12-2B13 loops become ordered (layer 2), resulting in the convergence 
of domain 2c and the formation of the ®-clamp. Last, as a result of the 
separation of domain 2c from domain 4, the precursors of domain 2s 
are released and refold into a uniform B-barrel (layer 3), which, as a 
final step, inserts into the endosomal membrane, ready for delivering 
the toxic enzymes into the cytosol. 

In the PA pore, the convergence of domain 2c creates a large inter- 
protomer interface to stabilize the pore conformation (Extended Data 
Fig. 7a). Key to the formation of a functional PA pore is a newly iden- 
tified interface between the 2B,-2B,9 loop of one protomer and the 
triple-stranded B-sheet (2B, 2B¢, and 287) of its neighbour (Fig. 5e). 
The above-mentioned convergence of domain 2c brings them together 
to form a new binding interface primarily mediated by hydrogen bonds 
between the backbone of the loop and the side chains of Thr390, Thr393, 
and Asp451 of the B-sheet (Fig. 5e). These interactions explain the 
dominant negative effect of Ser382 or Thr393 mutant, the loss of PA 
activity of the Cys mutant of Thr390 or Asp451, and the conservation 
of Thr390 and Thr393 in PA homologues”. By contrast, these residues 
are accessible and free of interactions in the PA prepore, and thus might 
be potential drug targets for blocking the conversion to functional 
PA pores. 

In summary, the 2.9-A structure of the anthrax PA pore suggests the 
2B10-2B11 loop as a pH sensor to trigger conformational changes for 
prepore to pore conversion, supports a charge-state-dependent Brownian 
ratchet model of proton-driven protein translocation, and can inform 
efforts both to engineer PA to target cancer cells and to design mea- 
sures to block anthrax toxin entering cells'‘**’. A more detailed mech- 
anism of protein translocation through the PA pore awaits additional 
experimental evidence, such as structures of the PA pore in the act of 
polypeptide translocation. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. 

Preparation of PA prepores. PAg;3 prepores were prepared following the described 
procedures*’. 

EM sample preparation and data acquisition. For negative-stain EM, 2 pil of 
0.1% polylysine solution (Polysciences) was first applied to a glow-discharged grid 
covered with carbon film and then removed by blotting with filter paper in 2 min. 
The polylysine treatment produced different orientations of particles on EM grids”, 
therefore overcoming the problem of preferred orientation for the PA pore. Imme- 
diately after removal of polylysine, 2 ull of PA prepore (~50 1g ml") was applied 
to the grid and incubated for 1 min. The grid was then washed with the high-pH 
buffer (50 mM HEPES, 50 mM NaCl, pH 8.0) twice followed by two washes with 
the low-pH buffer (50 mM NaOAc, pH 5.0, 0.05% Igepal CA-630) to induce the 
conversion of prepore to pore. After removal of excess buffer, the grid was stained 
with 0.8% (w/v) uranyl formate. 

For cryoEM, Quantifoil R1.2/1.3 holey grids were covered with a thin layer of 
continuous carbon film one day before use. The procedure for pore induction on 
cryoEM grids followed the same procedure for the negative-stain EM except that 
the last step of staining was not used. About 1.5 il of low-pH buffer was left on the 
grids before they were transferred into an FEI Vitrobot Mark IV. The grids were 
then blotted and flash-frozen in liquid ethane in the Vitrobot at 100% humidity. 
The frozen grids were stored in liquid nitrogen before use. 

Negative-stain EM micrographs were acquired with Leginon automation software?** 
and a TIETZ F415MP 16-megapixel CCD camera at X 68,027 magnification in an 
FEI Tecnai F20 electron microscope operated at 200 kV. The micrographs were 
saved by 2X binning to yield a pixel size of 4.4 A. 

Frozen-hydrated cryoEM grids were loaded into an FEI Titan Krios electron 

microscope operated at 300kV for automated image acquisition with Leginon. 
CryoEM micrographs were recorded on a Gatan K2 Summit direct electron detec- 
tion camera using the electron counting mode at 22,500 nominal magnification 
(calibrated pixel size of 1.28 A on the sample level) and defocus values ranging 
from —1.8 to —5.1 um. The dose rate on the camera was set to about 8 electrons 
per pixel per second. The total exposure time was 8 s and fractionated into 32 frames 
of subimages with 0.25 s exposure time for each frame. Frame images were aligned 
and averaged for correction of beam-induced drift using the GPU-accelerated 
program from Y. Cheng’s laboratory’. The average images from all frames were 
used for defocus determination and particle picking, and those from the first 16 
frames (corresponding to about 20 electrons per square angstrém total dose on 
sample) were used for two- and three-dimensional image classifications. In total, 
12,416 micrographs were taken in a continuous session. The best 7,062 micro- 
graphs were selected for the following in-depth data processing. 
Image processing. For negative-stain EM single particle reconstruction, 140,775 
particles were picked from 1,115 negative-stain EM micrographs using the batch- 
boxer program of EMAN*. Particles were windowed out in 96 pixels X 96 pixels. 
The defocus value of each micrograph was determined by CTFFIND” and part- 
icles were corrected for contrast transfer function (CTF) by phase-flipping with the 
corresponding defocus and astigmatism values using Bsoft**. An initial model was 
generated using the startcsym program of EMAN. The refinement was then per- 
formed with sevenfold symmetry using EMAN. 

For cryoEM single particle reconstruction, 21,200 particles (256 pixels < 256 
pixels) were initially picked by hand from 1,928 micrographs and subjected to 
auto-refinement by RELION”” using the negative-stain EM map obtained above 
as the initial model. The resulting cryoEM map (approximately 4-A resolution) 
was used to generate projections that were then served as templates to pick 259,719 
particles from all of the 7,062 micrographs using the batchboxer of EMAN. In our 
procedure, the defocus values of the micrographs were determined by CTFFIND 
and particles were corrected for CTF by phase-flipping using Bsoft. The particles 
were processed with two- and three-dimensional classifications using the recom- 
mended procedures of RELION (http://www2.mrc-Imb.cam.ac.uk/relion/index.php/ 
Recommended_procedures). Two-dimensional class averages and three-dimensional 
class reconstructions were inspected and those without high-resolution and inter- 
pretable features were considered as ‘bad’ classes. Particles contributing to the 
bad classes were discarded. The remaining 60,455 particles were selected for the 
final structure refinement. The C7 symmetry was applied throughout the three- 
dimensional classification and three-dimensional auto-refinement. 

To maximize usable signals from the frame images acquired with the K2 Summit 
camera, we used the resolution and dose-dependent model of radiation damage 
recently introduced in RELION-1.3 in the following steps“’. First, the particle images 
averaged from all 32 frames with whole-image drift correction were used for a 
preliminary three-dimensional auto-refinement. Second, particle images from indi- 
vidual frames were used to calculate translational alignments for the particle-based 
drift correction. A running average of seven frames, a standard deviation of one 


pixel, and fitting of linear tracks through the translations for all running averages 
were used for the optimal translational alignment following the suggested pro- 
tocol of RELION. Last, particle images from frame 3 to frame 27 (~30 electrons 
per square angstrém total dose on sample) were translated using the above optimal 
alignment and weighted with different B-factors as estimated from the single-frame 
reconstructions to generate optimal ‘shiny’ average images. Application of this 
procedure to the above selected particles yielded 60,455 ‘shiny’ particles. 

These ‘shiny’ particles were then subjected to three-dimensional auto-refinement 
in RELION to generate the final cryoEM map. RELION post-processing with a 
soft auto-mask* estimated a resolution of 2.9 A by the ‘gold standard’ Fourier 
shell correlation (FSC) at 0.143 criterion and a B-factor of —95 A”, and the post- 
processing without any mask reported a resolution of 3.3 A. The accuracies of rota- 
tion and translation reported by RELION three-dimensional auto-refinement were 
1.67° and 1.01 A. For visualization and atomic model building, the cryoEM map 
was sharpened and low-pass filtered by RELION post-processing using the above- 
mentioned B-factor and resolution. Local resolution was calculated by ResMap* 
using the two cryoEM maps independently refined from halves of data. 

Three-dimensional classification and auto-refinement also identified a subset of 

21,632 particles that led to a cryoEM map at 3.6-A resolution, which was in the 
pore state but lacked the density corresponding to the 14-stranded B-barrel: that is, 
it only had the corolla and calyx of the PA pore. 
Atomic model building and refinement. De novo atomic model building of the 
PA pore except its domain 4 was performed on the cryoEM map at 2.9-A resolu- 
tion using Coot. The coarse model was then refined using PHENIX in a pseudo- 
crystallographic manner*’. Note this procedure only improved the atomic model 
and did not modify the cryoEM map. Briefly, the cryoEM map was put into an arti- 
ficial crystal lattice to calculate its structure factor using the em_map_to_hklLinp 
utility program in CNS“. The amplitudes and phases of the structure factor were 
used as pseudo-experimental diffraction data for model refinement in PHENIX. 
The restraints of Ramachandran, secondary structure, and non-crystallographic 
symmetry were used in the refinement. 

The cryoEM maps and atomic models were visualized using UCSF Chimera” or 
PyMOL™. The available crystal structures of PAg; monomer (for example, PDB 
accession numbers 1ACC and 3TEW) and PAg; heptameric prepore (PDB acces- 
sion number 1TZO) are highly similar to each other; therefore 3TEW, which has 
the highest resolution among them, was used for structural comparison with our 
cryoEM structure of the PA pore in most situations and 1TZO was used when the 
inter-protomer interaction or domain movement were considered. 
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Extended Data Figure 1 | Negative-stain and cryoEM of the PA pore. 

a, Negative-stain EM micrograph of PA pore particles. Some representative 
top-view and side-view particles are selected with circles and squares, 
respectively. b, A full-size drift-corrected cryoEM micrograph of 3,710 pixels x 
3,710 pixels of PA pore particles acquired from a Gatan K2 Summit direct 
electron detection camera, at 300 kV accelerating voltage, -2.7 jm defocus, and 


a total dose of 39 electrons per square angstrém. Some representative side-view 
particles of PA pore are indicated by arrows. c, Power spectrum of the cryoEM 
micrograph in b. d, Representative cryoEM two-dimensional class averages 
of particles at different orientations. e-h, Superimposition of representative 
regions of the cryoEM map (mesh) with the atomic model (stick), including 
a-helix (e), B-strand (f), loop (g), and Ca** ions (green spheres in h). 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


a io b 10 
Cc Cc 
= fe} 
B08 oo 
oO oO 
= = 
[e) O06 
8 06 oS 
2 2 
& 04 5 04 
— = 
= @ 
5 02 502 
fe) (eo) 
ue 
0.0 i 0.0 H 
10 «8 6 5 4 34 29 10 8 6 5 4 34 29 
Resolution (A) Resolution (A) 
d came e QE25.7° 


Side view 


8=180° 


y Q=-25.7° 

8=90° 

Extended Data Figure 2 | Resolution estimation of the cryoEM single “gold standard’ FSC versus the same map (red) and versus the second 
particle reconstruction of the PA pore. a, “Gold standard’ FSC between two _ independent map (blue). c, Surface view and cut-through view of the 
independently refined maps with an auto-mask that was corrected by phase unsharpened cryoEM map coloured by local resolution estimated by ResMap™. 
randomization. The resolution was estimated by the ‘gold standard’ FSC at d, Euler angle distribution of all particles used for the final three-dimensional 


0.143 criterion”. b, FSC of the final atomic model versus the final cryoEM map __ reconstruction. 
(black); of a model refined in the first of the two independent maps used for the 
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Extended Data Figure 3 | Fitting of domain 4 into the cryoEM map of PA ___b, Translocation of domain 4 during the Prepore to pore conversion. Domain 4 
pore. a, The crystal structure of domain 4 (purple ribbons) from PA prepore __ is shifted inwards as a rigid-body for 4 A from the prepore conformation 
(PDB accession number 1TZO) fits the cryoEM map (grey surface) of PA (dark cyan) to the pore conformation (purple). 

pore with a good agreement (cross-correlation coefficient: 0.91). 
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Anthrax PA pore a-hemolysin 
Extended Data Figure 4 | Comparison of the structures of the PA pore, membrane-inserted forms. The approximate position of the lipid bilayer is 
a-haemolysin (PDB accession number 7AHL), and Vibrio cholerae illustrated with grey shade. The host cell receptor (TEM8 or CMG2) of PA that 


cytolysin (PDB accession number 3044). These three complexes are in their binds to domain 4 (grey ribbons) is schematically illustrated. 
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Extended Data Figure 5 | Assembling of the ®-clamp in the PA pore. 
2B7-2Bg and 2819-261; loops are coloured in cyan and orange, respectively. 


a, Comparison of the ®-clamp in the PA pore with its corresponding region in 
the PA prepore. The residues Asp426 and Phe427 missing in the crystal The hydrogen bonds between Ser428, Asn399, and Lys397', which form a chain 


structure of the PAg3 prepore (PDB accession number 1TZO) are modelled tethering the 287-2, and 2B,o-2f,1 loops together, are depicted with 
based the crystal structure of the PAg; monomer (PDB accession number dashed lines. 
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Extended Data Figure 6 | Conformational changes of domain 2c between 
the PA prepore and the PA pore. a, Superimposition of domains 2c of the PA 
pore and the crystal structure of the PA heptameric prepore (PDB accession 
number 1TZO). The dashed boxes highlight the conformational difference 
of the 2B;9-2B,, loop between these two structures. Residues Asp426 and 
Phe427 are not solved in the crystal structure of PA prepore because of their 
flexibility. b, Superimposition of domains 2c of the PA pore and the PA 
octameric prepore (PDB accession number 3HVD). Note the similarity of 
the 2B19-2B,, loop (dashed box) between these two structures. 

c, Superimposition of domains 2c of the PA pore and the PA prepore 
(monomer; PDB accession number 3TEW). d-f, Close-up views of the 


Pore CryoEM map of pore 
rearrangements of the 2By 9-2; loop (d), the 2B7-2Bs loop (e), and the 
285-28. and 2B,.-28,3 loops (f) during the conversion from prepore to pore. 
Val377 and Leu378 of the 285-28. loop and Val455 and Tyr456 of the 
2B12-2B13 loop are flipped upside down to obtain the B-strand conformation in 
the PA pore, leading to an extension of 28, and 28,, (f). CryoEM densities 
(mesh) corresponding to these loops are displayed to the right of d-f with 
atomic models superimposed, showing unambiguous atomic modelling. 

The superimpositions of the cryoEM map and the atomic model are shown in 
views different from the left panels for clarity. The conformational changes 
of these loops result in a more compact domain 2c with a decrease of 567 A” in 
its surface area. 
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a Prepore 


Extended Data Figure 7 | Compact assembly of the PA pore. a, Surface views 
of two neighbouring protomers (the same domains in the two protomors are in 
different shades of the same colours) of the PA pore and the PA prepore 
(PDB accession number 1TZO) visualized from inside the PA heptamer. 
Domains 1’, 2c, 2s, and 3 are coloured differently and domain 4 is not shown. 
The inter-protomer interface in the PA prepore is largely formed by domains 1’ 
and 3, and domain 2 only contributes to this interface by its membrane 
insertion loop and carboxy (C)-terminal region (213, 2B, 4, and 205). In the PA 
pore, the convergence of domain 2c creates an inter-protomer interface without 
any gap, with an increase of interface area on domain 2c from 1,247 A? of 
the PA prepore to 2,106 A? of the PA pore as calculated using PISA* (http:// 
www.ebi.ac.uk/pdbe/pisa/). Additionally, formation of the B-barrel also leads to 
extensive contacts, creating a new interface area of 1,195 A” between two 
protomers. b, Schematic of the conversion of 22, 2B3, 20.1, and the membrane 
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Pore 
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b Prepore 
275 352 
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: 2B,, 


insertion loop 


insertion loop of the PA prepore to 28,, and 25, of the PA pore. Hydrogen 
bonds between 28 and 283, which are depicted with dashed lines, are 
maintained during the conversion. By contrast, 2, and the connecting loops 
have to be fully unfolded and converted into {-strands that collectively 
assemble the B-barrel of the PA pore. Although the detailed events of B-barrel 
formation are not yet clear, it is likely that assembly starts from the top in a 
zipper-like manner. A favourable scenario is that the convergence of domain 
2c would place the top ends of the B-strands close to each other to form a 
short B-barrel, which could extend by pulling more residues together via 
formation of ordered hydrogen bonds until it reaches the bottom end. It is less 
favourable that the assembly starts from other regions because disordered 
hydrogen bonds and hydrophobic interactions could generate enormous 
non-productive, possibly irreversible pairings between strands. 
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Extended Data Figure 8 | CryoEM single-particle reconstruction ofa subset —14-stranded f-barrel. c, ‘Gold standard’ FSC (with an auto-mask that was 

of particles lacking the 14-stranded B-barrel. a, Surface views of the cryoEM corrected by phase randomization) between two independently refined maps. 
map lacking the B-barrel. The map is unsharpened and has a resolution of d, Top view of the ®-clamp region of the cryoEM map (mesh) lacking the 
3.6 A. b, Cross-section side view of superimposition of the unsharpened B-barrel superimposed with the atomic model of the PA pore (ribbons), 
cryoEM map with the atomic model of the intact PA pore, showing the cryoEM _ showing correct assembling of the ®-clamp in the cryoEM map. 

map has the same conformation as the PA pore except for the absence of the 
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Extended Data Table 1 | Data collection and structure refinement statistics 


Data Collection 


Particles 60,455 
Pixel size (A) 1.28 
Defocus range (um) -1.8 to -5.1 
Voltage (kV) 300 

Total electron dose 39 eA? 


Electron dose used in the final reconstruction 30 eA? 


Refinement 

Resolution (A) 2.9 
Map sharpening B-factor (A?) -95 
Average B-factor (A?) 81 


R.m.s deviations 
Bond lengths (A) 0.009 


Bond angles (°) 1.083 
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MEDICAL RESEARCH 


Subject to reflection 


Most scientists who study disease carry out their research with an eye to treating others — 
but a few have only to look at their own bodies to feel the need for a cure. 


BY VIRGINIA GEWIN 


a physician-scientist — but he never 

expected to be a research subject. Nor 
did he anticipate that his postdoc adviser 
would end up sifting through his genome for 
clues to treatment. 

In 2003, during his final year of medical 
school, Wartman was diagnosed with the blood 
cancer acute lymphoblastic leukaemia (ALL). 
He went into remission after chemotherapy, 
relapsed one year into his clinical fellowship in 
oncology and then recovered following a bone- 
marrow transplant. In 2008, he started postdoc- 
toral research in cancer genetics at Washington 
University in St Louis (WUSTL), Missouri. 


| ukas Wartman always wanted to be 


At the time, his adviser, cancer researcher 
Timothy Ley, was carrying out the first whole- 
genome sequencing study of acute myeloid 
leukaemia (AML). When Wartman relapsed 
again in 2011, Ley and his colleagues sequenced 
his cancer genome, too. They found that a 
certain gene was overactive, which pointed 
them to sunitinib, a drug for kidney cancer that 
is known to reduce the gene’ activity. 

Despite having to cope with complications 
from a second bone-marrow transplant, 
Wartman continues to conduct cancer research, 
emboldened by his own illness. He focuses on 
AML, which produces symptoms similar to 
those of his cancer, but has a different genetic 
cause. Last December, Wartman published 
preliminary data on the suitability of a drug for 
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both AML and ALL in the American Society of 
Hematology journal, Blood (L. Wartman et al. 
Blood 124, 5292; 2014). 

Scientists such as Wartman, who research 
diseases that affect themselves or their loved 
ones, occupy a curious niche in the scientific 
enterprise. Their experiences can offer unique 
research insight, garner media attention and 
provide valuable connections to patient groups. 
And they are highly motivated to find a cure. 

But their jobs are also fraught with the 
emotional — and sometimes ethical — 
challenges that can arise when researching 
something that resonates deeply on a personal 
level. Several of Wartman’s peers and mentors 
encouraged him to leave oncology to free him- 
self from thinking about cancer all the time. > 
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> Those who come to know a disease as 
both patient and investigator can take steps to 
preserve their emotional health and research 
objectivity. These scientists need to be mindful 
of their motivations and realistic about their 
goals — and foster a broad and deep support 
network for times when research findings hit 
too close to home. 


SOUL MEETS BODY 

When a personal connection is involved, the 
drive to improve research into and treatment 
ofa disease can be especially powerful. Geneti- 
cist Angela Christiano knows this first-hand: 
she jumped into her line of research after being 
diagnosed with an autoimmune disease called 
alopecia areata, which causes hair to fall out. 
Asa postdoc, she had worked on dermatologi- 
cal diseases, but she began to look for another 
research focus when she moved into a tenure- 
track position at Columbia University in New 
York City. One day in 1996, her hairdresser 
pointed out a bald spot on the back of her head. 
After her diagnosis, she learned that frustrat- 
ingly little was known about her condition. 

And so she began a decade-long journey 
with patient-advocacy groups to establish a 
registry that now contains 3,000 serum and cell 
samples for genetics work. In 2008, she con- 
ducted a genome-wide association study and 
found seven genes linked to the disease. She 
also uncovered similarities between alopecia 
areata and other autoimmune disorders such as 
type 1 diabetes and coeliac disease. 

Christiano launched clinical trials to test 
whether two drugs — one for rheumatoid 
arthritis and another for a bone-marrow 
disorder — could also combat alopecia areata. 
Last year, she published results showing that 
three patients treated with one of these drugs 
achieved near-full hair regrowth (A. Christiano 
et al. Nature Med. 20, 1043-1049; 2014). Her 
emotional bond with the patient community 
fuels her work. “I give talks every year to the 
National Alopecia Areata Foundation patient 
conference. The year that I presented the 
genome-wide association study — and when 
I announced drug candidates for clinical trial 
four years later — they gave me a standing 
ovation,’ she says. “We were all sobbing” 

A deep personal drive might be just what 
is needed in an understudied, underfunded 
field. Leonard Jason, a psychologist who 
directs clinical training, found his diagnosis 
of chronic fatigue syndrome (CFS) in 1990 to 
be life-altering, both personally and profes- 
sionally. After an 18-month leave of absence 
from DePaul University in Chicago, Illinois, 
to recuperate, he delved into the CFS literature 
and realized a need for a better definition of 
the illness, a less-stigmatizing name and more- 
robust research on prevalence. 

The field was riddled with questions — 
indeed, many physicians dispute whether CFS 
is even a genuine illness. “A number of folks 
had their careers destroyed by coming into this 
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| 
Lukas Wartman studies the genetics of cancer. 


area,’ Jason says. Cognizant of potential career 
fallout and his susceptibility to overexertion, he 
focused on what the field needed most and what 
he could realistically achieve. He spent a decade 
on epidemiological studies that expanded esti- 
mates of the US patient population from 20,000 
to 900,000. His work also showed that medical 
interns are more dismissive of the term chronic 
fatigue syndrome than of the more physiologi- 
cally based name, myalgic encephalomyelitis 
(L. Jason et al. Am. J. Community Psychol. 30, 
133-148; 2002). “The key is to keep the focus 
on small wins,” he says. “I’ve been able to work 
with patients, researchers and government 
officials for over two decades on a variety of 
topics — from changing the name, to searching 
for biological markers? 

Jason might be one of only a few people with 
CFS conducting research on the condition, 
but that is not the case in other areas. “In type 
1 diabetes research, there is an overrepresenta- 
tion of people with the disease,’ says Timothy 
Tree, an immunobiologist at King’s College 
London. And Tree is one of them. 

Tree is driven by one specific question — 
why did he, and not his brothers, inherit the 
disease from their father? His need to answer 
that question, as well as whether he could pass 
it on to his children, keeps him going in the lab. 
“When the experiment goes wrong or funding 
gets turned down, having the disease gives me 
a reason to stay in it for the long haul,” he says. 

Researchers in his position can offer 
practical perspectives, Tree says. For example, 
ifa colleague suggests an idea for a therapy, he 
can give an informed opinion on how likely 
patients would be to comply. “It gives me a 
different kind of objectivity,” 

Yet some people are concerned that such 
personal significance could compromise 
objectivity by introducing bias. “Bias can oper- 
ate at a subconscious level,” says David Resnik, 
a bioethicist at the US National Institute of 
Environmental Health Sciences in Research 
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Triangle Park, North Carolina. And the results 
can potentially skew the work, he adds. “It 
might lead a researcher to make interpreta- 
tions of data or study designs differently.” 

Rebecca Dresser, a research ethicist at 
WUSTL, says that researchers who study 
their own or family members’ diseases can 
take precautions to safeguard against bias or 
perceptions of bias. “Consult with people who 
don’t have the same personal perspective — 
and get them on the team reviewing grants and 
research protocols,’ she says. 

But researchers such as Christiano and Tree 
say that they can put scientific integrity ahead 
of hope. If anything, Christiano says, having 
alopecia areata made her more cautious and 
conservative. “I’m the most demanding one 
on the team. I want to be triple sure about any 
findings before we go forward.” And Tree notes 
that his experience with diabetes has taught 
him how important it is not to oversell positive 
results. “As a patient, I know that people hang 
on to every word you say for hope” 


AFINE LINE 
Balancing work and personal life is always 
important — more so when that work can have 
an impact on a scientist’s emotional state as well 
as productivity. “Having a bit of distance — 
between my own problems and the larger-scale 
problems we were trying to understand in the 
lab — definitely made things easier for me,’ says 
Wartman. Of course, he does not directly study 
his own genome. “Tim and I sat down and had 
a serious discussion about whether or not even 
doing the genomics of AML would be too psy- 
chologically heavy for me to take on,’ he says. 
Wartman’s experience has strongly influ- 
enced his priorities. He benefited from the 
sequencing of his cancer 


“As a patient, genome, and feels a 
I know that need to help others to 
people hang gain similar insight. So 
on to every he chose to devote his 
wordyousay energy to the genomics 
for hope.” tumour ward at WUSTL. 


Each month, he and his 
colleagues invite cancer-care professionals to 
present cases. Together, they decide which dis- 
eases merit genome sequencing, bridging the 
gap between research and discovery genomics. 
“I spend a lot of my offhours getting this up and 
running,’ he says. 

But sometimes the pressure can become too 
great. Michael Dodd was on track to conduct 
his postgraduate research at the University of 
Oxford, UK, on a condition that afflicts his 
father called hypertrophic cardiomyopathy, 
in which the heart muscle thickens. He did 
not know that he, too, carried the underlying 
genetic mutation, until a year into his PhD 
programme when a genetic test came back 
positive. “It was quite a shock to find out, 
especially since, to this day, I don't present any 
symptoms,’ he says. At one point, his supervi- 
sor, Hugh Watkins, doubled as his physician. 


BOB BOSTON 


EINAT SEGEV 


Dodd chose to shift his research focus 
elsewhere. “I sometimes found it weird 
to be in the lab,” he says. He was one of 
several patients who had the mutation, yet 
no symptoms, and so had MRI scans in 
their lab. “It was weird to see a bar graph, 
knowing I'm one of the points,” he says. 

The research could be emotionally taxing. 
“Tt would feel odd to work on, for example, 
a mouse with the same genetic mutation 
as me, and wonder if I would respond 
similarly,” he says. But he did want to keep 
working on the heart, so he is now a postdoc 
studying the cardiac effects of diabetes, a 
disease that his grandfather had. 


SPOTLIGHT SCARS 

The emotional toll can be especially intense 
when media attention forces the scientist 
into the public eye. Wartman felt the land- 
scape shift after a high-profile piece about 
him appeared in the New York Times in 2012. 
He is happy that patients find his personal 
perspective helpful, but regrets that the deci- 
sion to share his story no longer rests with 
him. “It’s still not the easiest topic for me to 
talk about,’ he says. “The last time I relapsed, 
I came close to dying. To rehash that ona 
regular basis is emotionally draining” 

Media attention can change one’s entire 
research career. Kay Redfield Jamison, a 
clinical psychiatrist and founder of a clinic 
for mood disorders at the University of 
California, Los Angeles, channelled 
her struggles with bipolar disorder into 
research on the illness's wide range of effects 
— from enhanced creativity to a high risk 
of suicide. But when she wrote her autobi- 
ography in 1995, entitled An Unquiet Mind: 
A Memoir of Moods and Madness, she knew 
that her professional life would never be the 
same. She gave up her clinical practice. “You 
can’t say that you’ve been psychotic and 
nearly died by suicide and expect people to 
look at you the same way,’ she says. 

Now at Johns Hopkins University in 
Baltimore, Maryland, Jamison focuses on 
writing and public speaking. She credits a 
network of supportive friends and colleagues 
for helping her to navigate her career ups 
and downs. “Becoming a poster child for an 
illness is draining,’ she says. “It becomes a 
disturbing part of your identity.” Still, it was 
worth it to reach others who were suffering. 
“That’s what good comes out of it” 

At the end of the day, that desire to aid 
others motivates many researchers to 
continue their work even though their own 
health is poor. “Leukaemia disrupted my 
career and goals and was a huge setback in 
my life” Wartman says. “At the same time, if 
Ican turn my own struggle into a story that 
helps other people, that has value.” m 


Virginia Gewin is a freelance writer in 
Portland, Oregon. 


TURNING POINT 


Roberto Kolter 


Roberto Kolter set up his microbiology 
laboratory at Harvard Medical School in 
Boston, Massachusetts, in 1983. Postdocs 
worldwide hope to join his lab because of his 
career-targeted training philosophy, but with 
rare exceptions, he brings in only those who 
already have a fellowship. 


Why do you accept postdocs only if they have 
their own funding? 

I focus on those whom I believe have a fan- 
tastic chance of getting their own funding as a 
principal investigator. I think it’s unfair for me 
to interview those who have very little chance 
of getting their own funding, considering how 
competitive the academic job market is and 
how important it is to show independence. 


What does your laboratory focus on? 

I let the postdocs explore what they want to 
explore, as long as it is within the sphere of my 
interest. I've worked on starvation physiology, 
biofilms, signalling, experimental evolution, 
antibiotics and many other subjects. 


Describe your training philosophy. 

I train people to go on into academia, industry 
the corporate world or whatever they want to 
go into. We need to give them the experience 
that they require, including learning how to 
teach and learning how to manage. Postdocs 
are not just there to come to the lab so that 
principal investigators can get their next grant. 


What stands out when you look at applications? 
I have learned that networking works very, 
very well. If I know who trained that indi- 
vidual, and I know and respect them, then I'll 
know a lot about how this postdoc will work 
in the lab. But that does not mean that if don’t 
know the mentor I will close the door to the 
postdoc. They need to have also done their 
homework — they need to know how I train 
people and how they think they would fit in. 


When have you made exceptions? 

There are one or two cases where I was com- 
pletely sure that they would get a fellowship, and 
they didn't. But by then I had gotten so excited 
about the project we had co-developed that I 
chose to support them from my own funds. 


How does your lab develop a research project? 
The ideas often emerge from conversations 
that start about 18 months before the postdoc 
comes to work with me. It has almost always 
been my policy that incoming postdocs build 
their research projects and are free to take the 
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project with them once they leave, to help 
them to set up their own lab. That gives the 
postdocs who are leaving a good opportunity 
to establish themselves without having to com- 
pete with me and the people in my lab. 


What careers do your postdocs pursue? 

About half the 100 or more postdocs that have 
gone through my lab hold full-time academic 
jobs, of which running a research lab is a big 
component. Many people whom I take on 
as postdocs want a job in the biotechnology 
arena. The other 50% are dominated by those 
who choose to join a company. Those can 
range from start-up biotech companies to very 
well-established pharmaceutical or chemical 
companies. Others lead research groups at 
institutes or government labs, work as research 
associates, teach science or do other science- 
related work. Only two have left science. 


Do they get permanent positions right away? 
No one who has come through my lab has had 
to leave science because they could not get a 
job. Personally, I believe that I have failed a 
postdoc if I take them into my lab and they 
cannot get a job that they love when they leave. 
That usually means that they have to go on to 
do a second postdoc. There have been very 
few such individuals — fewer than five, in the 
32 years I have had my own lab. So overall I 
rate my success rate in helping postdocs get 
their first job at about 90%. 


What do you see as the role of a postdoc? 

The meaning of postdoctoral training has been 
lost in today’s scientific community. As men- 
tors, we need to really reconsider what we are 
training postdocs for. And that’s just it: it’s a 
training period, nota job. = 


INTERVIEW BY JULIE GOULD 
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Uae SCIENCE FICTION 


AN EXCERPT FROM 
DYING FOR DUMMIES (2020) 


A guide to the next stage of your life. 


BY NORMAN SPINRAD 


INTRODUCTION 

It used to be that choosing how to die involved 
no complicated menu of alternatives. You 
waited for it to happen or you took your own 
life — and that was it. And if you didn’t want 
to decide on how to dispose of the material 
remains, you didn't have to — you could 
just leave that choice to your next of kin. But 
scientific advances have created more inter- 
esting and much more positive post-mortem 
modes to choose from. That's the good news. 

The bad news is that making the choice 
requires sophisticated technological knowl- 
edge that is beyond what the average person 
is likely to have. You must understand that 
the choice is, in effect, the most crucial one 
you have to make, not merely in your life but 
potentially for ever. And you need to realize 
that the consequences will not be assured, 
that post-mortem existence is not a matter 
of certainty, but a bet on the odds that each 
mode currently seems to offer. 

So the purpose of Dying For Dummies is to 
allow the public at large to make this choice 
in an enlightened and informed manner by 
explaining those odds and the costs of each 
post-mortem mode of existence. 


CHAPTER ONE: THE CURRENT POSSIBILITIES 
There are currently three post-mortem 
modes to choose from: 


1) Cryogenic freezing of either the whole 
body or just the head. This is the most expen- 
sive choice, because the cryogenic state must 
be maintained indefinitely until the technol- 
ogy to revive your body or at least your brain 
is developed, if ever. In addition, it would be 
best to be frozen before you are legally dead, 
but that is not legal in all jurisdictions. So, 
given the expense and the odds, this cannot 
be recommended for most people. 


2) Creation of a cloned body with your own 
genome and a brain transfer from your 
original body into it. This surgery is already 
possible and is favoured by those who believe 
that consciousness or the so-called ‘soul’ is 
an artefact of the unique individual cerebral 
meatware. Although initially more expensive 
than cryogenic freezing, it is cheaper in the 
medium run but, as with any major operation, 
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not without significant risk — and in the 
long run it is not a permanent solution, as the 
cloned body will eventually wear out too, and 
the procedure will have to be repeated. 

Also, the choice of retaining the original 
cerebral meatware would seem to mean that 
the brain will continue to deteriorate in the 
cloned body or sequence of cloned bodies, 
although this has yet to be proved. 

Still, this would be the obvious choice for 
those with religious or philosophical belief 
systems contending that if the ‘soul, i.e. the 
consciousness software, is not running on the 
original meatware, ‘you are not really there. 


3) Upload the instantaneous hologram 
of your consciousness and your memory 
data at or before the moment of death 
into a material matrix more robust than 
protoplasm — that is, computer hardware. 
Currently this seems to work to the point 

at which the ‘entity’ 


> NATURE.COM in the computer can 
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of the materially deceased speaks with the 
uploaded entity in the computer and with 
a synthesized emulation without being told 
which is which and correctly identifies the 
true post-mortem consciousness about 70% 
of the time, significantly above random. 

However, the technology to download 
the consciousness software in the computer 
to a cloned brain in a cloned body has not 
yet been perfected beyond experiments 
with simplified backups and with equivocal 
results. Some of the software entities in the 
computer hardware declare that they don't 
even want to be downloaded into disgust- 
ing squishy protoplasm and are more than 
content to wait until metal bodies become 
available. And some of them don’t even 
want those ‘bodies’ to emulate protoplas- 
mic human bodies, and instead are eager to 
become aeroplanes, spaceships, submarines 
and even more outré customized ‘bodies: 

This choice would seem to be the best one 
for those of modest means, as the upload to 
computer memory is the cheapest option, 
but it is probably only for the adventurous 
and those who believe that their essential 
being is the software of their consciousness 
independent of the meatware or hardware it 
is running on. 

However, if consciousness is data and soft- 
ware independent of the meatware or hard- 
ware it is running on, it can be duplicated 
and backed up. But if it can be duplicated 
once, it can be duplicated many times and, 
once the technology is perfected, down- 
loaded into multiple bodies, metallic crea- 
tions, cloned protoplasm, or all of the above. 

The practical advantages would be obvious. 
“You can be all the whatevers you want to be 
and at the same time. “You’ can be a whole 
football team, or a whole board of directors, 
ora whole symphony orchestra, or the whole 
population and machineries of an orbital 
habitat or otherwise generational starship. 

So the question is: 


CHAPTER TWO: WHICH IS ‘YOU’? 


... to be continued. m 


Norman Spinrad has been publishing novels 
in English for an actual half-century. His 
latest publication in English is the pamphlet 
Raising Hell. His latest novel has just been 
published in French as Police Du Peuple. 
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